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(57) Abstract: First generation adenoviral vectors and associated recombinant adenovirus-based HIV vaccines which show en- 
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novirdi vectors are utilized to generate and produce through cell culture various adenoviral -based HIV-1 vaccines which contain 
HIV-1 gag, mV-l pol and/or HIV-1 nef polynucleotide phaTmaceulical products, and biologically relevant modifications thereof, 
llicsc adenovirus vaccines, when directly introduced into living vertebrate tissue, preferably a mammalian host such as a human or a 
non-human mammal of commercial or domestic veterinary importance, express the HTVl- Gag, Pol and/or Nefprotein or biologically 
modification thereof, inducing a cellular immune response which specifically recognizes HTV-L The exemplified polynucleotides 
of the present invention are synthetic DNA molecules encoding HlV-1 Gag, encoding codon optimized HIV-1 Pol, derivatives of 
optimized HIV-l Pol (including constructs wherein protease, reverse transcriptase. RNAse H and integrase activity of HTV-l Pol 
is inactivated). HlV-1 Nef and derivatives of optimized HIV-1 Nef. including nef mutants which effect wild type characteristics of 
Nef, such as myristylation and down regulation of host CD4. The adenoviral vaccines of the present invention, when administered 
alone or in a combined modality regime, will offer a prophylactic advantage to previously uninfected individuals and/or provide 
a therapeutic effect by reducing viral load levels within an infected individual, thus prolonging the asymptomatic phase of HIV-1 
infection. 
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miJE OF THE INVENTION 

ENHANCED FIRST GENERATION ADENOVIRUS VACCINES EXPRESSING 
CODON OPTIMIZED IHVl-GAG, POL, NEF AND MODIFICATIONS 



5 CROSS-REFERENCE TO RELATED APPUCATIONS 

This application claims the benefit, under 35 U.S.C. §119(e), of U.S. 
provisional applications 60/233,180, 60/279,056, and Attorney Docket 20867PV2 
(serial number unassigned), filed September 15, 2000, March 27, 2001, and 
September 7, 2001, respectively. 

10 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
Not Applicable 



REFERENCE TO MICROHCHE APPENDIX 
15 Not Applicable 

'TELD OF THE INVENTION 

The present invention relates to recombinant, replication-deficient first 
generation adenovirus vaccines found to exhibit enhanced growth properties and 

20 greater cellular-mediated immunity as compared to other replication-deficient vectors. 
The invention also relates to the associated first generation adenoviral vectors 
described herein, which, through the incorporation of additional 5' adenovirus 
. sequence, enhance large scale production efficiency of the recombinant, repUcation- 
defective adenovirus described herein. Another aspect of the instant invention is the 

25 surprising discovery that the intron A portion of the human cytomegalovirus (hCMV) 
promoter constitutes a region of instability in adenoviral vector constructs. Removal 
of this region from adenoviral expression constructs results in greatiy improved vector 
stability. Therefore, improved vectors expressing a transgene under the control of an 
intron A-deleted CMV promoter constitute a further aspect of this invention. These 

30 adenoviral vectors are useful for generating recombinant adenovirus vaccines against 
human immunodeficiency virus (HTV). In particular, the first generation adenovirus 
vectors disclosed herein are utilized to construct and generate adenovirus-based HTV- 
1 vaccines which contain HIV-1 Gag, HIV-l Pol and/or HIV-1 Nef polynucleotide 
pharmaceutical products, and biologically active modifications thereof. Host 

35 administration of the recombinant, replication-deficient adenovirus vaccines described 
herein results in expression of HIV-1 Gag, HIV-l- Pol and/or Nef protein or 
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immunologically relevant modifications thereof, inducing a cellular immune response 
which specifically recognizes HTV-L Tho exemplified polynucleotides of the present 
invention are synthetic DNA molecules encoding codon optimized HIV-1 Gag, HIV-1 
Pol, derivatives of optimized HTV-l Pol (including constructs wherein protease, 
5 reverse transcriptase, RNAse H and integrase activity of HTV-l Pol is inactivated), 
HIV-l Nef, and derivatives of optimized HTV-l Nef, including nef mutants which 
effect wild type characteristics of Nef, such as myristylation and down regulation of 
host CD4. The HTV adenovirus vaccines of the present invention, when administered 
alone or in a combined modaUty and/or prime/boost regimen, will offer a prophylactic 
10 advantage to previously uninfected individuals and/or provide a therapeutic effect by 
reducing viral load levels within an infected individual, thus prolonging the 
asymptomatic phase of HIV-1 infection. 

BACKGROUND OF THE INVENTION 

15 Human Immunodeficiency Virus-1 (HIV-1) is the etiological agent of 

acquired human immune deficiency syndrome (AIDS) and related disorders. HIV-1 

an RNA virus of the Retroviridae family and exhibits the S'VTR-gag-pol-env- 
LTR 3 'organization of all retroviruses. The integrated form of HTV-l, known as the 
provirus, is approximately 9.8 Kb in length. Each end of the viral genome contains 

20 flanking sequences known as long terminal repeats (LTRs). The HTV genes encode at 
least nine proteins and are divided into three classes; the major structural proteins 
(Gag, Pol, and Env), the regulatory proteins (Tat and Rev); and the accessory proteins 
(Vpu, Vpr,VifandNef). 

The gag gene encodes a 55-kilodalton (kDa) precursor protein (p55) which is 

25 expressed from the unspliced viral mRNA and is proteolytically processed by the HTV 
protease, a product of the pol gene. The mature p55 protein products are pl7 
(matrix), p24 (capsid), p9 (nucleocapsid) and p6. 

The pol gene encodes proteins necessary for virus replication; a reverse 
transcriptase, a protease, integrase and RNAse H. These viral proteins are expressed 

30 as a Gag-Pol fusion protein, a 160 kDa precursor protein which is generated via a 
ribosomal frame shifting. The viral encoded protease proteolytically cleaves the Pol 
polypeptide away from the Gag-Pol fusion and further cleaves the Pol polypeptide to 
the mature proteins which provide protease (Pro, PIO), reverse transcriptase (RT, 
P50). integrase (IN, p31) and RNAse H (RNAse, pl5) activities. 
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The nefgtnc encodes an early accessory HIV protein (Nef) which has been 
shown to possess several activities such as down regulating CD4 expression, 
disturbing T-cell activation and stimulating HIV infectivity. 

The env gene encodes the viral envelope glycoprotein tiiat is translated as a 
S 160-kilodalton (kDa) precursor (gpl60) and dien cleaved by a cellular protease to 
yield the external 120-kDa envelope glycoprotein (gpl20) and the transmembrane 41- 
kDa envelope glycoprotein (gp41). Gpl20 and gp41 remain associated and are 
displayed on the viral particles and the surface of HIV-infected cells. 

The tat gene encodes a long form and a short form of the Tat protein, a RNA 
10 binding protein which is a transcriptional transactivator essential for HTV-l 
replication. 

The rev gene encodes the 13 kDa Rev protein, a RNA binding protein. The 
Rev protein binds to a region of the viral RNA termed the Rev response element 
(RRE). The Rev protein promotes transfer of unspliced viral RNA from the nucleus 

15 to the cytoplasm. The Rev protein is required for HIV late gene expression and in 
turn, HIV replication. 

Gpl20 binds to the CD4/chemokine receptor present on the surface of helper 
T-lymphocytes, macrophages and other target cells in addition to other co-receptor 
molecules. X4 (macrophage tropic) virus show tropism for CD4/CXCR4 complexes 

20 while a R5 (T-cell line tropic) vims interacts with a CD4/CCR5 receptor complex. 
After gpl20 binds to CD4, gp41 mediates the fusion event responsible for virus entry. 
The virus fuses with and enters the target cell, followed by reverse transcription of its 
single stranded RNA genome into the double-stranded DNA via a RNA dependent 
DNA polymerase. The viral DNA, known as pro virus, enters the cell nucleus, where 

25 the viral DNA directs the production of new viral RNA within the nucleus, expression 
of early and late HIV viral proteins, and subsequentiy the production and cellular 
release of new virus particles. Recent advances in the ability to detect viral load 
within the host shows that the primary infection results in an extremely high 
generation and tissue distribution of the virus, followed by a steady state level of vims 

30 (albeit through a continual viral production and turnover during this phase), leading 
ultimately to another burst of virus load which leads to the onset of clinical AIDS. 
Productively infected cells have a half life of several days, whereas chronically or 
latentiy infected cells have a 3 -week half life, followed by non-productively infected 
cells which have a long half life (over 100 days) but do not significantiy contribute to 

35 day to day viral loads seen throughout the course of disease. 
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Destruction of CD4 helper T lymphocytes, which are critical to immune 
defense, is a major cause of the progressive immune dysfunction that is the hallmark 
of HIV infection. The loss of CD4 T-cells seriously impairs the body's ability to fight 
most invaders, but it has a particularly severe impact on the defenses against viruses, 
5 fungi, parasites and certain bacteria, including mycobacteria. 

Effective treatment regimens for HIV-1 infected individuals have become 
available recently. However, these drugs will not have a significant impact on the 
disease in many parts of the world and they will have a minimal impact in halting the 
spread of infection within the human population. As is true of many other infectious 

10 diseases, a significant epidemiologic impact on the spread of HTV-l infection will 
only occur subsequent to the development and introduction of ah effective vaccine. 
There are a number of factors that have contributed to the lack of successful vaccine 
development to date. As noted above, it is now apparent that in a chronically infected 
person there exists constant virus production in spite of the presence of anti-HTV-l 

15 humoral and cellular immune responses and destruction of virally infected cells. As. 
in the case of other infectious diseases, the outcome of disease is the result of a 
balance between the kinetics and the magnitude of the immune response and the 
pathogen replicative rate and accessibility to the immune response. Pre-existing 
immunity may be more successful with an acute infection than an evolving immune 

20 response can be with an established infection. A second factor is the considerable 
genetic variability of the virus. Although anti-HIV-1 antibodies exist that can 
neutralize HIV-1 infectivity in cell culture, these antibodies are generally virus 
isolate-specific in their activity. It has proven impossible to define serological 
groupings of HIV-1 using traditional methods. Rather, the virus seems to define a 

25 serological "continuum" so that individual neutralizing antibody responses, at best, 
are effective against only a handful of viral variants. Given this latter observation, it 
would be useful to identify inmiunogens and related delivery technologies that are 
likely to elicit anti-HTV-l cellular inamune responses. It is known that in order to 
generate CTL responses antigen must be synthesized within or introduced into cells, 

30 subsequently processed into small peptides by the proteasome complex, and 

translocated into the endoplasmic reticulum/Gplgi complex secretory pathway for 
eventual association with major histocompatibility complex (MHC) class I proteins. 
CDS'*" T lymphocytes recognize antigen in association with class I MHC via the T cell 
receptor (TCR) and the CDS cell surface protein. Activation of naive CDS'*" T cells 

35 into activated effector or memory cells generally requires both TCR engagement of 
antigen as described above as well as engagement of costimulatory proteins. Optimal 
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induction of CTL responses usually requires "help" in the fonn of cytokines from 
CD4^ T lymphocytes which recognize antigen associated with MHC class 11 
molecules via TCR and CD4 engagement. 

European Patent Applications 0 638 316 (Published February 15, 1995) and 0 
5 586 076 (Published March 9, 1994), (both assigned to American Home Products 
Corporation) describe replicating adenovirus vectors carrying an HIV gene, including 
env or gag. Various treatment regimens were used with chimpanzees and dogs, some 
of which included booster adenovirus or protein plus alum treatments. 

Replication-defective adenoviral vectors harboring deletions in the El region 
10 are known, and recent adenoviral vectors have incorporated the known packaging 
repeats into these vectors; e.g., see EP 0 707 071, disclosing, inter alia, an adenoviral 
vector deleted of El sequences from base pairs 459 to 3328; and U.S. Patent No. 
6,033,908, disclosing, inter alia, an adenoviral vector deleted of base pairs 459-3510. 
The packaging efficiency of adenovirus has been taught to depend on the number of 
15 incorporated individual A (packaging) repeats; see^ e.g., Grable and Hearing, 1990 7. 
Virol 64(5):2047-2056; Grable and Hearing, 19927. Virol 66(2):723-731. 

Larder, et al., (1987, Nature 327: 716-717) and Larder, et al., (1989, Proc. 
Natl Acad. ScL 86: 4803-4807) disclose site specific mutagenesis of HTV-l RT and 
the effect such changes have on in vitro activity and infectivity related to interaction 
20 with known inhibitors of RT. 

Davies, et al. (1991, Science 252:, 88-95) disclose the crystal structure of the 
RNase H domain of HIV-1 Pol. 

Schatz, et al. (1989, FEES Lett. 257: 311-314) disclose that mutations 
Glu478Gln and His539Phe in a complete HIV-1 RT/RNase H DNA fragment results 
25 in defective RNase activity without effecting RT activity. 

Mizrahi, et al. (1990, Nucl Acids. Res. 18: pp. 5359-5353) disclose additional 
mutations Asp443Asn and Asp498 Asn in the RNase region of the pol gene which also 
results in defective RNase activity. The authors note that the Asp498Asn mutant was 
difficult to characterize due to instability of this mutant protein. 
30 , Leavitt, et al. (1993, J. Biol Chem. 268: 2113-2119) disclose several 

mutations, including a Asp64Val mutation, which show .differing effect on HTV-l 
integrase (IN) activity. 

Wiskerchen, et al. (1995, 7. Virol 69: 376-386) disclose singe and double 
mutants, including mutation of aspartic acid residues which effect HIV-1 IN and viral 
35 replication functions. 
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It would be of great import in the battle against AIDS to produce a 
prophylactic- and/or therapeutic-based HIV vaccine which generates a strong cellular 
immune response against an HIV infection. The present invention addresses and 
meets these needs by disclosing a class of adenovirus vaccines which, upon host 
5 administration, express codon optimized and modified versions of the HIV-1 genes, 
gag, pol and nef. These recombinant, replication-defective adenovirus vaccines may 
be administered to a host, such as a human, alone or as part of a combined modahty 
regimen and/or prime-boost vaccination regimen with components of the present 
• invention and/or a distinct viral HIV DNA vaccine, non-viral HTV DNA vaccine, HIV 
10 subunit vaccine, an HIV whole killed vaccine and/or a live attenuated HTV vaccine. 

SXJMMARY OF THE INVENTION 

The present invention relates to enhanced replication-defective recombinant 
adenovirus vaccine vectors and associated recombinant, replication-dejBcient 

15 adenovirus vaccines which encode various forms of HIV-1 Gag, HIV-1 Pol, and/or 
HIV-1 Nef, including immunologically relevant modifications of HIV-1 Gag, HIV-1 
Pol and HIV-1 Nef. The adenovirus vaccines of the present invention express HTV 
antigens and provide for improved cellular-mediated immune responses upon host 
administration. Potential vaccinees include but are not limited to primates and 

20 especially humans and non-human primates, and also include any non-human 
mammal of commercial or domestic veterinary importance. An effect of the 
improved recombinant adenovirus-based vaccines of the present invention should be a 
lower transmission rate to previously uninfected individuals (i.e., prophylactic 
applications) and/or reduction in the levels of the viral loads within an infected 

25 individual (i.e., therapeutic applications), so as to prolong the asymptomatic phase of 
HIV-1 infection. In particular, the present invention relates to adenoviral-based 
vaccines which encode various forms of codon optimized HIV-1 Gag (including but 
in no way limited to p55 versions of codon optimized full length (FL) Gag and tPA- 
Gag fusion proteins), HIV-1 Pol, HIV-l Nef, and selected modifications of 

30 immunological relevance. The administration, intracellular delivery and expression 
of these adenovirus vaccines elicit a host CTL and Th response. The preferred 
replication-defective recombinant adenoviral vaccine vectors include but are not 
limited to synthetic DNA molecules which (1) encode codon optimized versions of 
wild type HTV-l Gag; (2) encode codon optimized versions of HIV-l Pol; (3) encode 

35 codon optimized versions of HTV-l Pol fusion proteins; (4) encode codon optimized 
versions of modified HTV-l Pol proteins and fusion proteins, including but not liniited 
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to pol modifications involving residues within the catalytic regions responsible for 
RT, RNase and IN activity within the host cell; (5) encode codon optimized versions 
of wild type HTV-l Nef; (6) codon optimized versions of HIV-1 Nef fusion proteins; 
and/or (7) codon optimized versions of HIV-1 Nef derivatives, including but not 

5 limited to ne/ modifications involving introduction of an amino-terminal leader 
sequence, removal of an amino-terminal myristylation site and/or introduction of 
dileucine motif mutations. The Nef-based fusion and modified proteins, disclosed 
within this specification and expressed from an adenoviral-based vector vaccine this 
specification, may possess altered trafficking and/or host cell function while retaining 

10 the ability to be properly presented to the host MHC I complex and in turn elicit a 
host CTL and Th response. Examples of KV-l Gag, Pol and/or Nef fusion proteins 
include but are not limited to fusion of a leader or signal peptide at the NH2-teriminal 
portion of the viral antigen coding region. Such a leader peptide includes but is not 
limited to a tPA leader peptide. 

15 The adenoviral vector utilized in construction of the HIV-1 Gag-, HIV-1 Pol- 

and/or HTV-l Nef- based vaccines of the present invention may comprise any 
.^i>lication-defective adenoviral vector which provides for enhanced genetic stability 
of the recombinant adenoviral genome through large scale production and purification 
of the recombinant virus. In other words, an HIV-1 Gag-, Pol- or Nef-based 

20 adenovirus vaccine of the present invention is a purified recombinant, repHcation- 
defective adenovirus which is shown to be genetically stable through multiple 
passages in cell culture and remains so during large scale production and purification 
procedures. Such a recombinant adenovirus vector and harvested adenovirus vaccine 
lends itself to large scale dose filling and subsequent worldwide distribution 

25 procedures which will be demanded of an efficacious monovalent or multivalent HTV 
vaccine. The present invention meets this basic requirement with description of a 
replication-defective adenoviral vector and vectors derived therefrom, at least 
partially deleted in El, comprising a wildtype adenovirus cw-acting packaging region 
from about base pair 1 to between from about base pair 342 (more preferably, 400) to 

30 about base pair 458 of the wildtype adenovirus genome. A preferred embodiment of 
the instant invention comprises base pairs 1-450 of a wildtype adenovirus. In other 
preferred embodiments, the rephcation -defective adenoviral vector has, in addition 
thereto, a region 3' to the El-deleted region comprising base pairs 3511-3523. 
Basepairs 342-450 (more particularly, 400-450) constitute an extension of the 

35 5'region of previously disclosed vectors carrying viral antigens, particularly HTV 
antigens (see, e.g., PCT International Application PCTAJSOO/18332, published 
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January 11, 2001 (WO 01/02067), which claims priority to U.S. Provisional 
Application Serial Nos. 60/142,631 and 60/148,981, filed 7/6/1999 and 8/13/1999, 
respectively; these documents herein incorporated by reference. Applicants have 
found that extending the 5* region further into the El gene into the disclosed vaccine 
5 vectors incorporated elements found to be important in optimizing the packaging of 
the virus. 

As compared to previous vectors not comprising basepairs from about 1 to 
between from about base pair 342 (more preferably, 400) to about base pair 458 of the 
wildtype adenovirus genome, vectors comprising the above region exhibited enhanced 

10 growth characteristics, with approximately 5-10 fold greater amplification rates, a 
more potent virus effect, allowing lower doses of virus to be used to generate 
equivalent immunity; and a greater cellular-mediated immune response than 
replication-deficient vectors not comprising this region (basepairs 1-450). Even more 
important, adenoviral constructs derived therefrom are very stable genetically in 

15 large-scale production, particularly those comprising an expression cassette under the 
control of a hCMV promoter devoid of intron A. This is because AppUcants have 
• surprisingly found that the intron A portion of the hCMV promoter constituted a 
region of instability when employed in adenoviral vectors. Applicants have, 
therefore, identified an enhanced adenoviral vector which is particularly suited for use 

20 in gene therapy and nucleotide-based vaccine- vectors which, favorably, lends itself to 
large scale propagation. 

A preferred embodiment of this invention is a replication-defective adenoviral 
vector in accordance with the above description wherein the gene is inserted in the 
form of a gene expression cassette comprising (a) a nucleic acid encoding a protein or 

25 biologically active and/or immunologically relevant portion thereof; (b) a 

heterologous promoter operatively linked to the nucleic acid of part a); and, (c) a 
transcription terminator. 

In preferred embodiments, the El gene, other than that contained within 
basepairs 1-450 or, alternatively, that contained within base pairs 1-450 and 3511- 

30 3523 has been deleted from the adenoviral vector, and the gene expression cassette 
has replaced the deleted El gene. In other preferred embodiments, the replication 
defective adenovirus genome does not have a functional E3 gene, or the E3 gene has 
been deleted. Most preferably, the E3 region is present within the adenoviral genome. 
Further preferred embodiments are wherein the gene expression cassette is in an El 

35 anti-parallel (transcribed in a 3' to 5' direction relative to the vector backbone) 
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orientation or, more preferably, an El parallel (transcribed in a 5' to 3' direction 
relative to the vector backbone) orientation. 

Further embodiments relate to a shuttle plasmid vector comprising: an 
adenoviral portion and a plasmid portion, wherein said adenovirus portion comprises: 
5 a) a replication defective adenovirus genome, at least partially deleted in El, 

comprising a wildtype adenovirus cis-BCting packaging region from about base pair 1 
to between from about base pair 342 (more preferably, 400) to about base pair 458 
(preferably, 1-450) of the wildtype adenovirus genome and, preferably, in addition 
thereto, basepairs 3511-3523 of a wildtype adenovirus sequence; and b) a gene 

10 expression cassette comprising: (a) a nucleic acid encoding a protein or biologically 
active and/or immunologically relevant portion thereof; (b) a heterologous promoter 
operatively linked to the nucleic acid of part a);and (c) a transcription terminator 
and/or a polyadenylation site. 

Other aspects of this invention include a host cell comprising said adenoviral 

15 vectons and/or said shuttle plasmid vectors; vaccine compositions comprising said 
vectors; and methods of producing the vectors comprising (a) introducing the 
adenoviral vector into a host cell which expresses adenoviral El protein, and (b) 
harvesting the resultant adenoviral vectors. 

To this end, the present invention particularly relates to harvested 

20 recombinant, replication defective virus derived from a host cell, such as but not 
limited to 293 cells or PER.C6® cells, including but not limited to harvested virus 
related to any of the MRKAd5 vector backbones, with or without an accompanying 
transgene, including but not limited to the HIV-l antigens described herein. An 
HTV-l vaccine is represented by any harvested, recombinant adenovirus material 

25 which expresses any one or more of the HTV-l antigens disclosed herein. This 
harvested material may then be purified, formulated and stored prior to host 
administration. 

Another aspect of this invention is a method of generating a cellular immune 
response against a protein in an individual comprising administering to the individual 
30 an adenovirus vaccine vector comprising: 

a) a recombinant, replication defective adenoviral vector, at least partially 
deleted in El, comprising a wildtype adenovirus cw-acting adenovirus packaging 
region from about base pair 1 to between from about base pair 342 (more preferably, 
400) to about base pair 458 (preferably, 1-450) and, preferably in addition thereto, 
35 base pairs 3511-3523 of a wildtype adenovirus sequence, and. 
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b) a gene expression cassette comprising:© a nucleic acid encoding a protein 
or biologically active and/or immunologically relevant portion thereof; (ii) a 
heterologous promoter operatively linked to the nucleic acid of part a); and (iii) a 
transcription terminator and/or a polyadenylation site. 

5 In view of the efficacious nature of the adenoviral and/or DNA plasmid 

vaccines described herein, the present invention relates to all methodology regarding 
administration of one or more of these adenoviral and/or DNA plasmid vaccines to 
provide effective immunoprophylaxis, to prevent establishment of an HTV-l infection 
following exposure to this virus, or as a post-HIV infection therapeutic vaccine to 

10 mitigate the acute HIV-1 infection so as to result in the establishment of a lower virus 
load with beneficial long tenn consequences. As discussed herein, such a treatment 
regimen may include a monovalent or multivalent composition, various combined 
modality applications, and/or a primeA}Oost regimen to as to optimize antigen 
expression and a concomitant cellular-mediated and/or humoral inoonune response 

15 upon inoculation into a living vertebrate tissue. Therefore, the present invention 

provides for methods of using the adenoviral and/or DNA plasmid vaccines disclosed 
herein within the various parameters disclosed herein as well as any additional 
parameters known in the art, which, upon introduction into mammalian tissue induces 
intracellular expression of the gag, pol and/or nef-based vaccines. 

20 To this end, the present invention relates in part to methods of generating a 

cellular inomune response in a vaccinee, preferably a human vaccinee, wherein the 
individual is given more than one administration of adenovirus vaccine vector, and it 
may be given in a regimen accompanied by the administration of a plasmid vaccine. 
The plasmid vaccine (also referred to herein as a 'DNA plasmid vaccine" or "vaccine 

25 plasmid" comprises a nucleic acid encoding a protein or an immunologically relevant 
portion thereof, a heterologous promoter operably linked to the nucleic acid sequence, 
and a transcription terminator or a polyadenylation signal (such as bGH or SPA, 
respectively). There may be a predetermined minimum amount of time separating the 
administrations. The individual can be given a first dose of plasmid vaccine, and then 

30 a second dose of plasmid vaccine. Altematively, the individual may be given a first 
dose of adenovirus vaccine, and then a second dose of adenovirus vaccine. In other 
embodiments, the plasmid vaccine is administered first, followed after a time by 
administration of the adenovirus vaccine. Conversely, the adenovirus vaccine maybe 
administered first, followed by administration of plasmid vaccine after a time. In 

35 these embodiments, an individual may be given multiple doses of the same 

adenovirus serotype in either viral vector or plasmid form, or the virus may be of • 
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differing serotypes. In the alternative, a viral antigen of interest can be first delivered 
via a.viral vaccine other than an adenovirus-based vaccine, and then followed with the 
adenoviral vaccine disclosed. Alternative viral vaccines include but are not Hmited to 
pox virus and Venezuelan equine encephilitis virus. 
5 The present invention also relates to multivalent adenovirus vaccine 

compositions which comprise Gag, Pol and Nef components described herein; see, 
e.g.. Example 29 and Table 25. Such compositions will provide for an enhanced 
cellular immune response subsequent to host administration, particularly given the 
genetic diversity of human MHCs and of circulating virus. Examples, but not 

10 limitations, include MRICAdS-vector based multivalent vaccine compositions which 
provide for a divalent (i.e., gag and nef, gag and pol, or pol and nef components) or a 
trivalent vaccine (i.e., gag, pol and nef components) composition. Such a mutlivalent 
vaccine may be filled for a single dose or may consist of multiple inoculations of each ' 
individually filled component; and may in addition be part of a prime/boost regimen 

15 with viral or non-viral vector vaccines as introduced in the previous paragraph. To 
this end, preferred compositions are MRKAdS adenovirus used in combination with 
TDTiltiple, distinct BDIV antigen classes. Each HTV antigen class is subject to sequence 
manipulation, thus providing for a multitude of potential vaccine combinations; and 
such combinations are within the scope of the present invention. The utilization of 

20 such combined modalities vaccine formulation and administration increase the 

probability of eliciting an even more potent cellular immune response when compared 
to inoculation with a single modality regimen. 

The concept of a "combined modality" as disclosed herein also covers the 
alternative mode of administration whereby multiple HIV-1 viral antigens may be 

25 ligated into a proper shuttle plasmid for generation of a pre-adenoviral plasmid 
comprising multiple open reading frames. For example, a trivalent vector may 
comprise a gag-pol-nef fusion, in either a E3(-) or E3(+) background, preferably a E3 
deleted backbone, or possibly a "2+1" divalent vaccine, such as a gag-pol fusion (i.e., 
codon optimized p55 gag and inactivated optimized pol; Example 29 and Table 25) 

30 within the same MRKAd5 backbone, with each open reading frame being operatively 
linked to a distinct promoter and transcription termination sequence. Alternatively, the 
two open reading frames may be operatively linked to a single promoter, with the 
open reading frames operatively linked by an internal ribosome entry sequence 
(IRES). Therefore, a multivalent vaccine delivered as a single, or possibly a second 

35 harvested recombinant, replication-deficient adenovirus is contemplated as part of the 
present invention. 
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Therefore, the adenoviral vaccines and plasmid DNA vaccines of this 
invention may be administered alone, or may be part of a prime and boost 
administration regimen. A mixed modality priming and booster inoculation scheme 
will result in an enhanced immune response, particularly if pre-existing anti- vector 

5 immune responses are present. This one aspect of this invention is a method of 
priming a subject with the plasmid vaccine by administering the plasmid vaccine at 
least one time, allowing a predetermined length of time to pass, and then boosting by 
administering the adenoviral vaccine. Multiple primings typically, 1-4, are usually 
employed, although more may be used. The length of time between priming and 

10 boost may typically vary from about four months to a year, but other time frames may 
be used. In experiments with rhesus monkeys, the animals were primed four times 
with plasmid vaccines, then were boosted 4 months later with the adenoviral vaccine. 
Their cellular immune response was notably higher than that of animals which had 
only received adenoviral vaccine. The use of a priming regimen may be particularly 

15 preferred in situations where a person has a pre-existing anti-adenovirus immune 
response. ' 

It is an object of the present invention to provide for enhanced 
replication-defective recombinant adenoviral vaccine vector backbones. These 
recombinant adenoviral backbones may accept one or more transgenes, which may be 

20 passaged through cell culture for growth, amplification and harvest. 

It is a further object to provide for enhanced replication-defective recombinant 
adenoviral vaccine vectors which encode various transgenes. 

It is also an object of the present invention to provide for a harvested 
recombinant, replication-deficient adenovirus which shows enhanced growth and 

25 amplification rates while in combination with increased virus stability after 

continuous passage in cell culture. Such a recombinant adenovims is particularly 
suited for use in gene therapy and nucleotide-based vaccine vectors which, favorably, 
lends itself to large scale propagation. 

To this end, it is an object of the present invention to provide for (1) enhanced 

30 replication-defective recombinant adenoviral vaccine vectors as described herein 
. which encode various forms of HIV-1 Gag, HTV-l Pol, and/or HTV-l Nef, including 
immunologically relevant modifications of HIV-1 Gag, HIV-1 Pol and HTV-l Nef, 
and (2) harvested, purified recombinant repUcation-deficient adenovirus generated by 
passage of the adenoviral vectors of (1) through one or multiple passages through ceU 

35 culture, including but not limited to passage through 293 cells or PER.C6® cells. 
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It is also an object of the present invention to provide for recombinant 
adenovirus harvested by one or multiple passages through cell culture. As relating to 
recombinant adenoviral vaccine vector, this recombinant virus is harvested and 
formulated for subsequent host administration. 
5 It is also an object of the present invention to provide for replication-defective 

adenoviral vectors wherein at least one gene is inserted in the form of a gene 
expression cassette comprising (a) a nucleic acid encoding a protein or biologically 
active and/or Ltnmunologically relevant portion thereof; (b) a heterologous promoter 
operatively linked to the nucleic acid of part a); and, (c) a transcription temainator. 

10 It is also an object of the present invention to provide for a host cell 

comprising said adenoviral vectors and/or said shuttle plasmid vectors; vaccine 
compositions comprising said vectors; and methods of producing the vectors 
comprising (a) introducing the adenoviral vector into a host cell which expresses 
adenoviral El protein, and (b) harvesting the resultant adenoviral vectors. 

15 It is a further object of the present invention to provide for methods of generating a 
cellular inmiune response against a protein in an individual comprising administering 
tO the individual an adenovirus vaccine vector comprising a) a replication defective 
adenoviral vector, at least partially deleted in El, comprising a wildtype adenovirus 
cw-acting packaging region from about base pair 1 to between from about baset pair 

20 342 (more preferably, 400) to about 450 (preferably, 1-450) and, preferably, 3511- 
3523 of a wildtype adenovirus sequence, and, b) a gene expression cassette 
comprising: (i) a nucleic acid encoding a protein or biologically active and/or 
immunologically relevant portion thereof; (ii) a heterologous promoter operatively 
linked to the nucleic acid of part a); and (iii) a transcription terminator and/or a 

25 polyadenylation site. 

It is also an object of the present invention to provide various alternatives for 
vaccine adnMnistration regimes, namely administration of one or more adenoviral 
and/or DNA plasmid vaccines described herein to provide effective 
immunoprophylaxis for uninfected individuals or a therapeutic treatment for HIV 

30 infected patients. Such processes include but are not limited to multivalent HIV-1 
vaccine compositions, various combined mo.dality regimes as well as. various . 
prime/boost alternatives. These methods of administration, relating to vaccine 
composition and/or scheduled administration, will increase the probability of eliciting 
an even more potent cellular immune response when compared to inoculation with a 

35 single modality regimen. 
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As used throughout the specification and claims, the following definitions and 
abbreviations are used: 

"HAART" refers to - highly active antiretroviral therapy 

"first generation" vectors are characterized as being replication-defective. 
5 They typically have a deleted or inactivated El gene region, and preferably have a 
deleted or inactivated E3 gene region as well. 

"AEX" refers to Anion Exchange chromatography. 

"QPA" refers to Quick PCR-based Potency Assay. 

"bps" refers to basepairs. 
10 "s" or "str" denotes that the transgene is in the El parallel or "straight" 

orientation. 

"PBMCs" refers to peripheral blood monocyte cells. 
"FL" refers to full length. 

"FLgag" refers to a full-length optimized gag gene, as shown in Figure 2. 
15 "Ad5-Hgag" refers to an adenovirus serotype 5 replication deficient virus 

which carries an expression cassette which comprises a full length optimized gag gene 
under the control of a CMV promoter. 

"Promoter" means a recognition site on a DNA strand to which an RNA 
polymerase binds. The promoter forms an initiation complex with RNA polymerase 
20 to initiate and drive transcriptional activity. The complex can be modified by 
activating sequences such as enhancers or inhibiting sequences such as silencers. 

"Leader" means a DNA sequence at the 5' end of a structural gene which is 
transcribed along with the gene. This usually results a protein having an N-terminal 
peptide extension, often referred to as a pro-sequences. 
25 "Intron" means a section of DNA occurring in the middle of a gene which 

does not code for an amino acid in the gene product. The precursor RNA of the intron 
is excised and is therefore not transcribed into mRNA not translated into protein. 

"Immunologically relevant" or "biologically active" means (1) with regards to 
a viral protein, that the protein is capable, upon administration, of eliciting a 
30 measurable inraiune response within an individual sufficient to retard the propagation 
and/or spread of the virus and/qr to reduce the viral load present within the individual; 
or (2) with regards to a nucleotide sequence, that the sequence is capable of encoding 
for a protein capable of the above. 

"Cassette" refers to a nucleic acid sequence which is to be expressed, along 
35 with its transcription and translation^ control sequences. By changing the cassette, a 
vector can express a different sequence. 
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"bGHpA" refers to the bovine growth hormone transcription 
terminator/polyadenylation sequence, 

"tPAgag" refers to a fusion between the leader sequence of the tissue 
plasminogen activator leader sequence and an optimized HTV gag gene, as 
5 exemplified in Figure 30A-B, whether in a DNA or adeno virus-based vaccine vector. 

Where utilized, "lA" or "inact" refers to an inactivated version of a gene (e.g. 

lApol). 

"MCS" is "multiple cloning site". 

In general, adenoviral constructs, gene constructs are named by reference to 

10 the genes contained therein. For example: 

**Ad5 mV-l gag", also referred to as the original HIV-1 gag adenoviral 
vector, is a vector containing a transgene cassette composed of a hCMV intron A 
promoter, the full length version of the human codon-optimized HTV-l gag gene, and 
the bovine growth hormone polyadenylation signal. The transgene was inserted in the 

15 El antiparallel orientation in an El and E3 deleted adenovector. 

"MRK Ad5 fflV-1 gag" also referred to as "MRKAdSgag" or "Ad5gag2" is 
an adenoviral vector taught herein which is deleted of El, comprises basepairs 1-450 
and 351 1-3523, and has a human codon-optimized HIV-1 gene in an El parallel 
orientation under the control of a CMV promoter without intron A. The construct 

20 also comprises a bovine growth hormone polyadenylation signal. 

"pVl JnsHIVgag", also referred to as "HIVFLgagPR990r', is a plasmid 
comprising the CMV inunediate-early (IE) promoter and intron A, a full-length 
codon-optimized HTV gag gene, a bovine growth hormone-derived polyadenylation 
and transcriptional termination sequence, and a noinimal pUC backbone. 

25 "pVl JnsCMV(no intron)-FLgag-bGHpA" is a plasmid derived from 

pVl JnsHIVgag which is deleted of the intron A portion of CMV and which comprises 
the full length HTV gag gene. This plasmid is also referred to as "pVl JnsHTVgag- 
bGHpA", pVl Jns-hCMV-FL-gag-bGHpA" and "pVl JnsCMV(no intron) + FLgag + 
bOHpA". 

30 "pVlJnsCMV(no intron)-FLgag-SPA" is a plasmid of the same composition 

as pVlJnsCMV(no intron)-FLgag-bGHpA except that the SPA termination sequence 
replaces that of bGHpA. This plasmid is also referred to as "pVl Jns-HIVgag-SPA" 
and pVl Jns-hCMV-FLgag-SPA". 

*'pdeIElsplA" is a universal shuttle vector with no expression cassette (i.e., no 

35 promoter or polyA). The vector comprises wildtype adenovirus serotype 5 (Ad5) 
sequences from bp 1 to bp 341 and bp 3524 to bp 5798, and has a multiple cloning 
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site between the Ad5 sequences ending 341 bp and beginning 3524 bp. This plasmid 
is also referred to as the original Ad 5 shuttle vector. 
"MRKpdelElsplA" or "MRKpdelEl(Pac/pIX/pack450)" or 

*'MRKpdelEl(Pac/pIX/pack450)Clar' is a universal shuttle vector with no expression 
5 cassette (i.e. no promoter or poly A) comprising wildtype adenovirus serotype 5 (Ad5) 
sequences from bpl to bp450 and bp 351 1 to bp 5798. The vector has a multiple 
cloning site between the Ad5 sequence ending 450 bp and beginning 3511 bp* This 
shuttle vector may be used to insert the CMV promoter and the bGHpA fragments in 
both the straight ("str". or El parallel) orientation or in the opposite (opp. or El 

10 antiparallel) orientation) 

"MRKpdelEl(Pac/pIX/pack450)4<:MVmin+BGH^ is still another 

shuttle vector which is the modified vector that contains the CMV promoter (no 
intronA) and the bGHpA fragments. The expression unit containing the hCMV 
promoter (no intron A) and the bovine growth hormone polyadenylation signal has 

15 been inserted into the shuttle vector such that insertion of the gene of choice at a 

unique BgllL site will ensure the direction of transcription of the transgene will be Ad5 
^ l parallel when inserted into the MRKpAd5(El/E3+)Clal pre-plasmid. This shuttle 
vector, as shown in Figures 22 and 23, was used to insert the respective lApol and 
G2A,LLAA nef genes directly into. 

20 "MRKpdelEl-CMV(no intron)-FLgag-bGHpA" is a shuttle comprising Ad5 

sequences from basepairs 1-450 and 3511-5798, with an expression cassette 
containing hunnian CMV without intron A, the full-length human codon-optimized 
HIV gag gene and bovine growth hormone polyadenylation signal. This plasmid is 
also referred to as 'TMRKpdelEl shuttle +hCMV-FL-gag-BGHpA" 

25 "MRKpAdHVE3+CMV(no intron)-FLgag-bGHpA" is an adenoviral vector 

comprising all Ad5 sequences except those nucleotides encompassing the El region 
(from 451-3510), a human CMV promoter without intron A, a full-length human 
codon-optimized HIV gag gene, and a bovine growth hormone polyadenylation 
signal. This vector is also referred to as "MRKpAdHVE3 + hCMV-FL-gag- 

30 BGHpA", "MRKpAd5HIV-lgag", *Ty[RKpAd5gag". "pMRKAd5gag" or 
"pAd5gag2'.\ 

"pVl Jns-HTV-pol inact(opt)" or "pVl Jns-HIV lA pol (opt) is the inactivated 
Pol gene (contained within SEQ ID NO:3) cloned into the BglH site of VlJns (Figure 
17 A-C). As noted herein, various derivatives of HIV-1 pol may be cloned into a 
35 plasmid expression vector such as VI Jns or VI Jns-tPA, thus serving directly as DNA 
vaccine candidates or as a source for subcloning into an appropriate adenoviral vector. 
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"MRKpdel+hCMVimn+FL-pol+bGHpA(s)" is the 
"MRKpdelElCPac/pIX/pack450)+CMVmin+BGHpA(str.)" shuttle mentioned above 
which contains the lA pol gene is the proper orientation. This shuttle vector is used in 
a bacterial recombination with MRKpAd(El-/E3+)Clal. 
5 "MRKpAd+h(:MVmin+Ii:.-pol+bGHpA(S)E3+", also referred to herein as 

"pMRKAdSpol", is the pre-adenovirus plasmid which comprises a CMV-pol 
inact(opt)-pGHpA construct. The construction of this pre-adenovirus plasmid is 
shown in Figure 22. 

"pVl Jns/nef (G2A,LLAA)" or "VlJns/opt nef (G2A,LLAA)" comprises 

10 codon optimized HIV-1 Nef wherein the open reading frame codes for modifications 
at the amino terminal rayristylation site (Gly-2 to Ala-2) and substitution of the Leu- 
174-Leu-175 dileucine motif to Ala-174-AIa-175 (SEQ ID NO:13; which comprises 
an initiating methionine residue at nucleotides 12-14 and a *TAA" stop codon from 
nucleotides 660-662). This fragment is subcloned into the Bgl II site of VI Jns 

15 and/orVl Jns-tPA (Figures 16A-B). As noted above for HIV-1 pol, HIV-1 nef 

constructs may be cloned into a plasmid expression vector such as VI Jns or VI Jns- 
tPA, thus serving directly as DNA vaccine candidates or as a source for subcloning 
into an appropriate adenoviral vector. 

"MRKpdelElhCMVminFL-nefBGHpA(s)", also referred to herein as 

20 "pMRKAd5nef is the pre-adenovirus plasmid which comprises a CMV-nef 

(G2A,LLAA) codon optimized sequence. The construction of this pre-adenovirus 
plasmid is shown in Figure 23. 

BRIEF DESCRIPTION OF THE HGURES 
25 Figure 1 shows the original HTV-l gag adenovector (Ad5HIV-lgag). This 

vector is disclosed in PCT fiitemational Application No. PCT/USOO/18332 (WO 
01/02607) filed July 3, 2000, claiming priority to U.S. Provisional Application Serial 
No. 60/142,631, filed July 6, 1999 and U.S. Application Serial No. 60/148,981, filed 
August 13, 1999, all three applications which are hereby incorporated by reference. 
30 Figure 2 shows the nucleic acid sequence (SEQ ID NO: 29) of the optimized 

. human HIV-1 gag open reading frame. . . 

Figure 3 shows diagrammatically the new transgene constructs in comparison 
with the original gag transgene. 

Figure 4 shows the modifications made to the original adenovector backbone 
35 in the generation of the novel vectors of the instant invention. 
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Figure 5 shows the virus mixing experiments that were carried out to 
determine the effects of the addition made to the packaging signal region (Bxpt. #1) 
and the E3 gene on viral growth (Expt. #2). The bars denote the region of 
modifications made to the El deletion. 

5 Figure 6 shows an autoradiograph of viral DNA analysis following the viral 

mixing experiments described in Examples 6 and 7. 

Figures 7 A, 7B and 7C are as follows: Figure 7 A shows the hCMV-Flgag- 
bGHpA adenovectors constructed within the MRKpAdHVE3 and MRKpAdHVO 
adenovector backbones. Both El parallel and El antiparallel transgene orientation are 

10 represented. Figure 7B shows the hCMV-Flgag-SPA adenovectors constructed within 
the MRKpAdHVE3 and MRKpAdHVO adenovector backbones. Again, both El 
parallel and El antiparallel transgene orientation are represented. Figure 7C shows 
the mCMV-Flgag-bGHp A adenovectors constructed within the MRKpAdHVES and 
MRKpAdHVO adenovector backbones. Once again, both El parallel and El 

15 antiparallel transgene orientation are represented. 

Figure 8A shows the experiment designed to test the effect of transgene 
orientation. 

Figure 8B shows the experiments designed to test the effect of 
polyadenylation signal. 
20 Figure 9 shows viral DNA from the four adenoviral vectors tested (Example 

12) at P5, following BstBl 1 digestion. 

Figure 10 shows viral DNA analysis of passages 11 and 12 of 
MRKpAdHVES, MRKAdSHIV-lgag, andMRKAdSHIV-lgagES-. 

Figure 11 shows viral DNA analysis (HindSn digestion) of passage 6 
25 MEUCpAdHVEB and MRKAd5HIV-lgag used to initiate the viral competition study. 
The last two lanes are passage 1 1 analysis of duplicate passages of the competition 
study (each virus at MOI of 280 viral particles). 

Figure 12 shows viral DNA analysis by Hind HI digestion on high passage 
numbers for MRKAd5HIV-lgag in serum-containing media with collections made at 
30 specified times. The first lane shows the Ikb DNA size marker. The other lanes 

represent pre-plasmid control (digested with Pad and Hindni), MRKAd5HIV-lgag 
atP16,P19, andP21. 

Figure 13 shows serum anti-p24 levels at 3 wks post i.m. immunization of 
balb/c mice (n=10) with varying doses of several Adgag constructs: (A) MRK Ad5 
35 HTV-l gag (through passage 5); (B) MRKAd5 hCMV-ELgag-bGHpA (E3-); (C) 
MRKAdS hCMV-FLgag-SPA (E3+); (D) MRKAdS mCMV-FLgag-bGHpA (E3+); 



18 



wo 02/22080 PCTAJSOl/28861 



(E) research lot (293 cell-derived) of Ad5HIV-l gag; and (F) clinical lot 
(AdSgagFNOOOl) of Ad5HIV-l gag. Reported are the geometric mean titers (GMT) 
for each cohort along with the standard error bars. 

Figure 14 shows a restriction map of the pMRKAdSIDV-lgag vector. 
5 Figures 15A-X illustrates the nucleotide sequence of the pMRKAdSHIV-lgag 

vector (SEQ ID NO:27.[coding] and SEQ ID NO:28 [non-coding]). 

Figures 16A-B shows a schematic representation of DNA vaccine expression 
vectors VI Jns (A) and VI Jns-tPA (B), which are utilized for HTV-l gag, pel and nef 
constructs in various DNA/viral vector combined modality regimens as disclosed 
10 herein. 

Figures 17A-C shows the nucleotide (SEQ ID N0:3) and amino acid sequence 
(SEQ ID N0:4) of lA-Pol. Underlined codons and amino acids denote mutations, as 
listed in Table 1. 

Figure 18 shows codon optimized nucleotide and amino acid sequences 
15 through the fusion junction of tPA-pol inact(opt) (contained within SEQ ID NOs: 7 
and 8, respectively). The underlined portion represents the NHa-tenninal region of 
lA-Pol. 

Figures 19A-B show a nucleotide sequence comparison between wild type 
nef(jrfl) and codon optimized nef. The wild type nef gene from the jifl isolate 
20 consists of 648 nucleotides capable of encoding a 216 amino acid polypeptide. WT, 
wild type sequence (SEQ ED NO: 19); opt, codon-optimized sequence (contained 
within SEQ ID NO:l). The Nef amino acid sequence is shown in one-letter code 
(SEQIDN0:2). 

Figures 20A-C show nucleotide sequences at junctions between nef coding 
25 sequence and plasmid backbone of nef expression vectors VI Jns/nef (Figure 20A), 

VlJns/nef(G2A,LLAA) (Figure 20B), VlJns/tpanef (Figure 20C) and 

VlJns/tpanef(LLAA) (Figure 20C, also). 5' and 3' flanking sequences of codon 

optimized nef or codon optimized nef mutant genes are indicated by bold/italic letters; 

nef and nef mutant coding sequences are indicated by plain letters. Also indicated (as 
30 underlined) are the restriction endonuclease sites involved in construction of 

respective nef expression vectors. VI Jns/tpanef and VI Jns/tpanef(LLAA) have 

identical sequences at the junctions. 

Figure 21 shows a schematic presentation of nef and nef derivatives, Amino 

acid residues involved in Nef derivatives are presented. Glycine 2 and Leucinel74 
35 and 175 are the sites involved in myristylation and dileucine motif, respectively. For 

both versions of the tpanef fusion genes, the putative leader peptide cleavage sites are 
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indicated with and a exogenous serine residue introduced during the construction 
of the mutants is underlined 

Figure 22 shows diagrammatically the construction of the pre-adenovirus 
plasmid construct, MRKAdSPol. 
5 Figure 23 shows diagrammatically the construction of the pre-adenovirus 

plasmid construct, MRKAdSNef . 

Figure 24 shows a comparison of clade B vs. clade C anti-gag T cell responses 
in clade B HIV-infected subjects. 

Figure 25 shows a comparison of clade B vs. clade C anti-nef T cell responses 
10 in clade B HIV-infected subjects. 

Figures 26A-AO illustrates the nucleotide sequence of the pMRKAdSHIV- 
Ipol adenoviral vector (SEQ ID NO:32 [coding] and SEQ ID NO:33 [non-coding]), 
comprising the coding region of the inactivated pol gene (SEQ ID N03). 

Figures 27A-AM illustrates the nucleotide sequence of the pMRKAdSHTV-l 
15 nef adenoviral vector (SEQ ID NO:34 [coding] and SEQ ID NO:35 [non-coding]), 
comprising the coding region of the inactivated pol gene (SEQ ID N013). 

Figure 28 shows the stability of MRKAdS vectors comprising various 
prornoter fragments (hCMV or mCMV) and terminations signals (bGH or SPA) in 
E3(+) or E3(") backbones. 
20 Figures 29A and B shows the anion-exchange HPLC viral particle 

concentrations of the freeze-thaw recovered cell associated virus at the 24, 36, 48, and 
60 hpi time points (Figure 29 A) and the timcourse QPA supernatant titers (Figure 
29B) for MRKAd5gag, MRKAd5pol and MRKAd5nef. 

Figure 30 shows the nucleotide sequence (SEQ ID NO:36) and amino acid 
25 sequence (SEQ ID NO:37) comprising the open reading frame of a representative 
tPA-gag fusion for use in the DNA and/or adenoviral vaccine disclosed herein. 

Figure 31 shows the intracellular ylFN staining of PBMCs collected at week 
10 (post DNA prime) and week 30 (post Ad boost). The cells were stimulated 
overnight in the presence or absence of the gag peptide pool. They were subsequently 
30 stained using fluorescence-tagged anti-CD3, anti-ODS, anti-CD4, and anti-ylEW 

monoclonal antibodies. Each plot shows all CD3+ T cells which were segregated in 
terms of positive staining for surface CD8 and ylFN production. The numbers in the 
upper right and lower right quadrants of each plot are the percentages of CD3^ cells 
that were CDS'^yDFN^ ™d CD4'*yIFK*", respectively. 
35 Figure 32 shows a comparison of single-modality adenovirus immunization 

with DNA + adjuvant prime/adenovims boost immunization. 
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Figures 33A-B show the nucleotide sequence (SEQ ID NO: 38) of the open 
reading frame for the gag-IApol fusion of Example 29. 

Figures 34A-B show the protein sequence (SEQ ID NO:39) of the gag-IApol 
fustion frame. 

5 

DETAILED DESCRIPTION OF IHE INVENTION 

A novel replication-defective, or "first generation," adenoviral vector suitable 
for use in gene therapy or nucleotide-based vaccine vectors is described. This vector 
is at least partially deleted in El and comprises a wildtype adenovirus c/.s-acting 

10 packaging region from about base pair 1 to between about base pair 342 (more 
preferably, 400) to about 458 (preferably, 1-450) and, preferably, 3511-3523 of a 
wild-type adenovirus sequence. It has been found that a vector of this description 
possesses enhanced growth characteristics, with approximately 5-10 fold greater 
amplification rates, and is more potent allowing lower doses of virus to be used to 

15 generate equivalent immunity. The vector, furthermore, generates a harvested 

recombinant adenovirus which shows greater cellular-mediated immune responses 
*^han replication-deficient vectors not comprising this region (basepairs 342-450). 
Adenoviral constructs derived from these vectors are, further, very stable genetically, 
particularly those comprising a transgene under the control of a hCMV promoter 

20 devoid of intron A. Viruses in accordance with this description were passaged 

continually and analyzed; see Example 12. Each virus analyzed maintained it correct 
genetic structure. Analysis was also carried out under propagation conditions similar 
to that performed in large scale production. Again, the vectors were found to possess 
enhanced genetic stability; see Figure 12. Following 21 passages, the viral DNA 

25 showed no evidence of rearrangement, and was highly reproducible from one 
production lot to the next. The outcome of all relevant tests indicate that the 
adenoviral vector is extremely well suited for large-scale production of recombinant, 
replication-deficient adenovirus, as shown herein with the data associated with Figure 
28. 

30 A preferred adenoviral vector in accordance with this description is a vector 

comprising basepairs 1-450, which is deleted in E3. This vector can accommodate up 
to approximately 7,500 base pairs of foreign DNA inserts (or exogenous genetic 
material). Another preferred vector is one retaining E3 which comprises basepairs 1- 
450. A prefeiTed vector of this description is an E3+ vector comprising basepairs 1- 

35 450 and 351 1-3523. This vector, when deleted of the region spanning basepairs 451- 
3510, can acconmiodate up to approximately, 4,850 base pairs of foreign DNA inserts 
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(or exogenous genetic material). The cloning capacities of the above vectors have 
been determined using 105% of the wildtype Ad5 sequence as the upper genome size 
limit. 

Wildtype adenovirus serotype 5 is used as the basis for the specific basepair 

5 numbers provided throughout the specification. The wildtype adenovirus serotype 5 
sequence is known and described in the art; see^ Chroboczek et al, 1992 /. Virology 
186:280, which is hereby incorporated by reference. Accordingly, a particular 
embodiment of the instant invention is a vector based on the adenovirus serotype 5 
sequence. One of skill in the art can readily identify the above regions in other 

10 adenovirus serotypes (e.g., serotypes 2, 4, 6, 12, 16, 17, 24, 31, 33, and 42), regions 
defined by basepairs corresponding to the above basepair positions given for 
adenovirus serotype 5. Accordingly, the instant invention encompasses all adenoviral 
vectors partially deleted in El comprising basepairs corresponding to 1-450 
(particularly, 342-450) and, preferably, 3511-3523 of a wild-type adenovirus serotype 

15 5 (Ad5) nucleic acid sequence. Particularly preferred embodiments of the instant 
invention are those derived from adenoviruses like Ad5 which are classified in 
"•ubgroup C (e.g., Ad2). 

Vectors in accordance with the instant invention are at least partially deleted in 
El. Preferably the El region is completely deleted or inactivated. Most preferably, 

20 the region deleted of El is within basepairs 451-3510. It is to be noted that tiie 

extended 5' and 3' regions of the disclosed vectors are believed to effectively reduce 
the size of the El deletion of previous constructs without overlapping any part of the 
ElAyElB gene present in the cell line used, i.e., the PER,C6® cell line transefected 
with base pairs 459-3510. Overlap of adenoviral sequences is avoided because of the 

25 possibility of recombination. One of ordinary skill in the art can certainly appreciate 
that the instant invention can, therefore, be modified if a different cell line transfected 
with a different segment of adenovirus DNA is utilized. For purposes of 
exemplification, a 5' region of base pairs 1 to up to 449 is more appropriate if a cell 
line is transfected with adenoviral sequence from base pairs 450-3510. This holds 

30 true as well in the consideration of segments 3' to the El deletion. 

Preferred embodiments of the instant invention possess an intact E3 region 
(i.e., an E3 gene capable of encoding a functional E3). Alternate embodiments have a 
partially deleted E3, an inactivated E3 region, or a sequence completely deleted of E3, 
Applicants have found, in accordance with the instant invention, that virus comprising 

35 the E3 gene were able to amplify more rapidly compared with virus not comprising an 
E3 gene; see Figure 6 wherein a diagnostic CsCl band corresponding to the E3+ virus 
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tested (5,665 bp) was present in greater amount compared with the diagnostic band of 
3,010 bp corresponding to the E3- virus. These results were obtained following a 
virus competition study involving mixing equal MOI ratio (1:1) of adenovectors both 
comprising the E3 gene and not comprising the E3 gene. This increased amplification 
5 capacity of the E3+ adenovectors was subsequently confirmed with growth studies; 
see Table 4A, wherein the E3+ virus exhibit amphfication ratios of 470, 420 and 320 
as compared with the 1 15 and 40-50 of the E3- constructs. 

As stated above, vectors in accordance with the instant invention can 
accommodate up to approximately 4,850 base pairs of exogenous genetic material for 

10 an E3+ vector and approximately 7,500 base pairs for an E3- vector. Preferably, the 
insert brings the adenoviral vector as close as possible to a wild-type genomic size 
(e.g., for Ad5, 35,935 basepairs). It is well known that adenovirus amplifies best 
when they are close to their wild-type genomic size. 

The genetic material can be inserted in an El-parallel or an El anti-parallel 

15 orientation, as such is illustrated in Figure 7 A, 7B, 7C and Figure 8 A. Particularly 
preferred embodiments of the instant invention, have the insert in an El-parallel 
"^>ntation. Applicants have found, via competition experiments with plasmids 
containing transgenes in differing orientation (Figure 8A), that vector constructs with 
the foreign DNA insert in an El -parallel orientation amplify better and actually out- 

20 compete El-antiparallel-oriented transgenes. Viral DNA analysis of the mixtures at 
passage 3 and certainly at passage 6, showed a greater ratio of the virus carrying the 
transgene in the El parallel orientation as compared with the El anti-parallel version. 
By passage 10, the only viral species observed was the adenovector with the transgene 
in the El parallel orientation for both transgenes tested. 

25 Adenoviral vectors in accordance with the instant invention are particularly 

well suited to effectuate expression of desired proteins, one example of which is an 
HIV protein, particularly an HIV full length gag protein. Exogenous genetic material 
encoding a protein of interest can exist in the form of an expression cassette. A gene 
expression cassette preferably comprises (a) a nucleic acid encoding a protein of 

30 interest; (b) a heterologous promoter operatively linked to the nucleic acid encoding 
the protein; and (c) a transcription terminator. 

The transcriptional promoter is preferably recognized by an eukaryotic RNA 
polymerase. In a preferred embodiment, the promoter is a "strong" or "efficient" 
promoter. An example of a strong promoter is the immediate early human 

35 cytomegalovirus promoter (Chapman et al, 1991 Nucl Acids J?e5l9:3979-3986, which 
is incorporated by reference), preferably without intronic sequences. Most preferred 
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for use within the instant adenoviral vector is a human CMV promoter without 
intronic seqeunces, like intron A. Applicants have found that intron A, a portion of 
the human cytomegalovirus promoter (hCMV), constitutes a region of instability for 
adenoviral vectors. CMV without intron A has been found to effectuate (Examples 1- 

5 3) comparable expression capabilities in vitro when driving HIV gag expression and, 
furthermore, behaved equivalently to intron A-containing constructs in Balb/c mice in 
vivo with respect to their antibody and T-cell responses at both dosages of plasmid 
DNA tested (20 \ig and 200 jig). Those skilled in die art will appreciate that any of a 
number of other known promoters, such as the strong immunoglobulin, or other 

10 eukaryotic gene promoters may also be used, including the EFl alpha promoter, the 
murine CMV promoter, Rous sarcoma virus (RS V) promoter, S V40 early/late 
promoters and the beta-actin promoter. 

In preferred embodiments, the promoter may also comprise a regulatable 
sequence such as the Tet operator sequence. This would be extremely useful, for 

15 example, in cases where the gene products are effecting a result other than that 
desired and repression is sought. 

Preferred transcription termination sequences present within the gene 
expression cassette are the bovine growth hormone terminator/polyadenylation signal 
(bGHpA) and the short synthetic polyA signal (SPA) of 50 nucleotides in length, 

20 defined as follows: AATAAAAGATCTTTATTTTCATTAGATCTGTGTGTTGGT- 
TTTTTGTGTG (SEQ ED NO:26). 

The combination of the CMV promoter (devoid of the intron A region) with 
the BGH terminator is particularly preferred although other promoter/terminator 
combinations in the context of FG adenovirus may also be used. 

25 Other embodiments incorporate a leader or signal peptide into the transgene. 

A preferred leader is that from the tissue-specific plasminogen activator protein, tPA. 
Examples include. but are not limited to the various tPA-gag, tPA-pol and tPA-nef 
adenovirus-based vaccines disclosed throughout this specification. 

In view of the improved adenovirus vectors described herein, an essential 

30 portion of the present invention are adenoviral-based HIV vaccines comprising said 
adenovirus backbones which may be administered to a mammalian host, preferably a 
human host, in either a prophylactic or therapeutic setting. The HIV vaccines of the 
present invention, whether administered alone or in combination regimens with other 
vural- or non- viral-based DNA vaccines, should elicit potent and broad cellular 

35 immune responses against HIV that will either lessen the likelihood of persistent virus 
infection and/or lead to the establishment of a clinically significant lowered virus load 
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subject to HIV infection or in combination with HAART therapy, mitigate the effects 
of previously established HTV infection (antiviral immunotherapy(ARI)). While any 
fflV antigen (e.g., gag, pol, nef, gpl60, gp41, gpl20, tat, rev, etc.) may be utilized in 
the herein described recombinant adenoviral vectors, preferred embodiments include 
5 the codon optimized p55 gag antigen (herein exemplified as MRKAdSgag), pol and 
nef. Sequences based on different Clades of HTV-l are suitable for use in the instant 
invention, most preferred of which are Clade B and Clade C. Particularly preferred 
embodiments are those sequences (especially, codon-optimized sequences) based on 
concensus Clade B sequences. Preferred versions of the MRKAd5pol and 

10 MRKAdSnef series of adenoviral vaccines will encode modified versions of pol or 
nef, as discussed herein. Preferred embodiments of the MRKAd5HIV-l vectors 
carrying HIV envelope genes and modifications thereof comprise the HTV codon- 
optimized env sequences of PCX Intemational Applications PCTAJS97/02294 and 
PCT/US97/10517. published August 28, 1997 (WO 97/31115) and December 24, 

15 1997, respectively; both documents of which are hereby incorporated by reference. 

A most preferred aspect of the instant invention is the disclosed use of the 
adenoviral vector described above to effectuate expression of HIV gag. Sequences 
for many genes of many HIV strains are publicly available in GENBANK and 
primary, field isolates of HIV are available from the National Institute of Allergy and 

20 Infectious Diseases (NIAID) which has contracted with Quality Biological 

(Gaithersburg, MD) to make these strains available. Strains are also available from 
the World Health Organization (WHO), Geneva Switzerland. It is preferred that the 
gag gene be from an HTV-l strain (CAM-1; Myers et al, eds. "Human Retroviruses 
and AIDS: 1995, IIA3-IIA19, which is hereby incorporated by reference). This gene 

25 closely resembles the consensus amino acid sequence for the clade B (North 

American/European) sequence. Therefore, it is within the purview of the skilled 
artisan to choose an appropriate nucleotide sequence which encodes a specific HIV 
gag antigen, or inununologically relevant portion thereof. As shown in Example 25, a 
clade B or clade C based p55 gag antigen will potentially be useful on a global scale. 

30 As noted herein, the transgene of choice for insertion in to a DNA or MRKAd-based 
adenoviral vector of the present invention is a codon optimized version of p55 gag. 
Such a MRKAdSgag adenoviral vector is documented in Example 11 and is at least 
referred to herein as MRKAd5HIV-lgag. Of course, additional versions are 
contemplated, including but not limited to modifications such as promoter (e.g., 

35 mCMV for hCMV) and/or pA-terminations signal (SPA for bGH) switching, as well 
as generating MRK Ad5 backbones with or without deletion of the Ad5 E3 gene. 

25 
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The present invention also relates a series of MKKAdSpol-based adenoviral 
vaccines which are shown herein to generate cellular inrmiune responses subsequent to 
administration in mice and non-human primate studies. Several of the MRKAdSpol 
series are exemplified herein. One such adenoviral vector is referred to as 

5 MRKAdShCMV-inact opt pol(E3+), which comprises the MRKAd5 backbone, the 
hCMV promoter (no intron A), an inactivated pol transgene, and contains the Ad5 E3 
gene in the adenoviral backbone. A second exemplified pre-adenovirus plasmid and 
concomitant virus is referred to as MRKAd5hCMV-inact opt pol(E3-), which is 
identical to the fonner adenoviral vector except that the E3 is deleted. Both 

10 constructions contain a codon optimized, inactivated version of HIV-1 Pol, wherein at 
least the entire coding region is disclosed herein as SEQ ID NO:3 and the expressed 
protein is shown as SEQ ID N0:4 (see also Figure 17A-C and Table 1, which show 
targeted deletion for inactivated pol. This and other preferred codon optimized 
versions of HDTV Pol as disclosed herein are essentially as described in U.S. 

15 Application Serial No. 09/745,221, filed December 21, 2000 and PCT International 
Application PCTAJSOO/34724, also filed December 21, 2000, both documents which 
are hereby incorporated by reference. As disclosed in the above-mentioned 
documents, the open reading frame for these codon-optimdzed HIV-1 Pol-based DNA 
vaccines are represented by codon optimized DNA molecules encoding codon 

20 optimized HIV-1 Pol (e.g. SEQ ID N0:2), codon optimized fflV-1 Pol fused to an 
amino terminal localized leader sequence (e.g. SEQ ID N0:6), and especially 
preferable, and exemplified by the MRKAd5-Pol construct in e.g., Example 19, 
biologically inactivated pol ("inact opt Pol"; e.g., SEQ ID N0:4) which is devoid of 
significant PR, RT, RNase or IN activity associated with wild type Pol. In addition, a 

25 construct related to SEQ ID NO:4 is contemplated which contains a leader peptide at 
the amino terminal region of the lA Pol protein. A specific construct is ligated within 
an appropriate DNA plasmid vector containing regulatory regions operatively linked 
to the respective HIV-1 Pol coding region, with or without a nucleotide sequence 
encoding a functional leader peptide. To this end, various HIV-1 Pol constructs 

30 disclosed herein relate to open reading frames for cloning to the enhanced first 
generation Ad vectors of the present invention (such a series of MRKAdSpol 
adenoviral vaccine vectors), including but not limited to wild type Pol (comprising the 
DNA molecule encoding WT opt Pol, as set forth in SEQ ID N0:2), tPA-opt WTPol, 
(comprising the DNA molecule encoding tPA Pol, as set forth in SEQ ID NO:6), inact 

35 opt Pol (comprising the DNA molecule encoding lA Pol, as set forth in SEQ ID 

N0:4), and tPA-inact opt Pol, (comprising the DNA molecule encoding tPA-inact opt 
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Pol, as set forth in SEQ ID NO:8). The pol-based versions of enhanced first 
generation adenovirus vaccines elicit CTL and Th cellular immune responses upon 
adn3inistration to the host, including primates and especially humans. As noted in the 
above, an effect of the cellular immune-directed vaccines of the present invention 
5 should be a lower transmission rate to previously uninfected individuals and/or 

reduction in the levels of the viral loads within an infected individual, so as to prolong 
the asymptomatic phase of HTV-I infection. 

The present invention further relates to a series of MRKAd5nef-based 
adenoviral vaccines which, similar to HTV gag and pol antigens, generate cellular 

10 immune responses subsequent to administration in mice and non-human primate 
studies. The MRKAdSnef series are exemplified herein by utilizing the improved 
MRK adenoviral backbone in combination with modified versions of HIV nef . These 
exemplified MRKAdSnef vectors are as follows: (1) MRKAdShCMV- 
nef (G2A,LLAA) (E3+), which comprises the improved MRKAdS backbone, a human 

15 CMV promoter an intact Ad5 E3 gene and a modified nef gene: (2) MRKAdSmCMV- 
nef(G2A,LLAA) (E3+), which is the same as (1) above but substituting a murine 
CMV promoter for a human CMV promoter; and (3) MRKAdSmCMV-tpanef (LLAA) 
(E3+), which is the same as (2) except that the nef transgene is tpanef(LLAA). Codon 
optimized versions of HTV-l Nef and HTV-l Nef modifications are essentially as 

20 described in U.S. Application Serial No. 09/738,782, filed December 15, 2000 and 
PCT International Apphcation PCT/USOO/34162, also filed December 15, 2000, both 
documents which are hereby incorporated by reference. Particular embodiments of 
codon optimized Nef and Nef modifications relate to a DNA molecule encoding HIV- 
1 Nef from the HIV-1 jfrl isolate wherein the codons are optimized for expression in a 

25 mammalian system such as a human. The DNA molecule which encodes this protein 
is disclosed herein as SEQ ID N0:9, while the expressed open reading frame is 
disclosed herein as SEQ ID NO: 10. Another embodiment of Nef-based coding 
regions for use in the adenoviral vectors of the present invention comprise a codon 
optimized DNA molecule encoding a protein containing the human plasminogen 

30 activator (tpa) leader peptide fused with the NH2-temQinus of the HTV-l Nef 

polypeptide. The DNA molecule which encodes this protein is disclosed herein as 
SEQ ID N0:1 1, while the expressed open reading firame is disclosed herein as SEQ 
ID NO: 12. Another modified Nef optimized coding region relates to a DNA molecule 
encoding optimized HIV-1 Nef wherein the open reading frame codes for 

35 modifications at the amino terminal myristylation site (Gly-2 to Ala-2) and 

substitution of the Leu-174-Leu-175 dileucine motif to Ala-174-Ala-175, herein 

27 



wo 02/22080 



PCTAJSOl/28861 



described as opt nef (G2A, LLAA). The DNA molecule which encodes this protein is 
disclosed herein as SEQ ID NO: 13, while the expressed open reading frame is 
disclosed herein as SEQ ID NO: 14. MRKAd5nef vectors (1) MRKAd5hCMV- 
nef(G2A,LLAA) (E3+) and (2) MRKAd5mCMV-nef(G2A,LLAA) (E3+) contain this 
5 transgene. An additional embodiment relates to a DNA molecule encoding optimized 
HIV-1 Nef wherein the amino terminal myristylation site and dileucine motif have 
been deleted, as well as comprising a tPA leader peptide. This DNA molecule, opt 
tpanef (LLAA), comprises an open reading frame which encodes a Nef protein 
containing a tPA leader sequence fused to amino acid residue 6-216 of HTV-l Nef 

10 (jfirl), wherein Leu-174 and Leu-175 are substituted with Ala-174 and Ala-175, herein 
referred to as opt tpanef (LLAA) is disclosed herein as SEQ ID NO: 15, while the 
expressed open reading frame is disclosed herein as SEQ ID NO: 16. The 
MRKAdSnef vector "MRKAdSmCMV-tpanef (LLAA) (E3+)" contains this transgene. 
Along with the improved MRKAdSgag adenovirus vaccine vector described 

15 herein, generation of a MRKAdSpol and MRKAdSnef adenovirus vector provide for 
enhanced HIV vaccine capabilities. Namely, the generation of this trio of adenoviral 
vaccine vectors, all shown to generate effective cellular immune responses subsequent 
to host administration, provide for the ability to administer these vaccine candidates 
not only alone, but preferably as part of a divalent (i.e., gag and nef, gag and pol, or 

20 pol and nef components) or a trivalent vaccine (i.e., gag, pol and nef components). 
Therefore, a preferred aspect of the present invention are vaccine formulations and 
associated methods of administration and concomitant generation of host cellular 
immune responses associated with formulating three separate series of MRKAd5- 
based adenoviral vector vaccines. Of course, this MRKAd5 vaccine series based on 

25 distinct HTV antigens promotes expanded opportunities for formulation of a divalent 
or trivalent vaccine, or possibly administration of separate formulations of one or 
more monovalent or divalent formulations within a reasonable window of time. It is 
also within the scope of the present invention to embark on combined modality 
regimes which include multiple but distinct components from a specific antigen. An 

30 example, but certainly not a limitation, would be separate lVIRKAd5pol vectors, with 
one vaccine vector expressing wild type Pol (SEQ ID N0:2) and another 
MRKAd5pol vector expressing inactivated Pol (SEQ ID N0:6). Another example 
might be separate MElKAdSnef vectors, with one vaccine vector expressing the 
tPA/LLAA version of Nef (SEQ ID NO: 16) and another MRKAdSnef vector 

35 expressing the G2A,LLAA modified version of Nef (SEQ ID NO: 14). Therefore, the 
MRKAdS adenoviral vectors of the present invention may be used in combination 
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with multiple, distinct HIV antigen classes. Each HIV antigen class is subject to 
sequence manipulation, thus providing for a multitude of potential vaccine 
combinations; and such combinations are within the scope of the present invention. 
The utilization of such combined modalities vaccine formulation and administration 
5 increase the probability of eliciting an even more potent cellular immune response 
when compared to inoculation with a single modality regimen. 

The present invention also relates to application of a mono-, dual-, or tri- 
modality administration regime of the MRKAd5gag, pol and nef adenoviral vaccine 
series in a prime/boost vaccination schedule. This prime/boost schedule may include 

10 any reasonable combination of the MRKAdSgag, pol and nef adenoviral vaccine 

series disclosed herein. In addition, a prime/boost regime may also involve other viral 
and/or non-viral DNA vaccines. A preferable addition to an adenoviral vaccine 
vector regime includes but is not limited to plasmid DNA vaccines, especially DNA 
plasmid vaccines that contain at least one of the codon optimized gag, pol and nef 

1 5 constructions, as disclosed herein. 

Therefore, one aspect of this invention is the administration of the adenoviral 
vector containing the optimized gag gene in a prime/boost regiment in conjunction 
with a plasmid DNA encoding gag. To distinguish this plasmid from the adenoviral- 
containing shuttle plasmids used in the construction of an adenovirus vector, this 

20 plasmid will be referred to as a "vaccine plasmid" or "DNA plasmid vaccine". 
Preferred vaccine plasmids for use in this administration protocol are disclosed in 
pending U.S. patent application 09/017,981, filed February 3, 1998 and WO98/34640, 
published August 13, 1998, both of which are hereby incorporated by reference. 
Briefly, the preferred vaccine plasmid is designated VlJns-FLgag, which expresses 

25 the same codon-optimized gag gene as the adenoviral vectors of this invention (see 
Figure 2 for the nucleotide sequence of the exemplified optimized codon version of 
full length p55 gag). The vaccine plasmid backbone, designated VI Jns contains the 
CMV immediate-early (DB) promoter and intron A, a bovine growth hormone-derived 
polyadenylation and transcription termination sequence as the gene expression 

30 regulatory elements, and a minimal pUC backbone; see Montgomery et aL, 1993, 
DNA Cell Biol. 12:777-783. The pUC sequence permits high levels of plasmid 
production in E. coli and has a neomycin resistance gene in place of an ampicillin 
resistance gene to provide selected growth in the presence of kanamycin. 
Alternatively, a vaccine plasmid which has the CMV promoter deleted of intron A can 

35 be used. Those of skill in the art will recognize that alternative vaccine plasmid 
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vectors may be easily substituted for these specific constructs, and this invention 
specifically envisions use of such alternative plasmid DNA vaccine vectors. 

Another aspect of the present invention is a prime/boost regimen which 
includes a vaccine plasmid which encodes an HTV pol antigen, preferably a codon 
5 optimized form of pol and also preferably a vaccine plasmid which comprises a 
nucleotide sequence which encodes a Pol antigen selected from the group of Pol 
antigens as shown in SEQ ID NOs: 2, 4, 6 and 8. The variety of potential DNA 
plasmid vaccines which encode various biologically active forms of HIV-1 Pol, 
wherein administration, intracellular delivery and expression of the HTV-l Pol gene of 

10 interest elicits a host CTL and Th response. The preferred synthetic DNA molecules 
of the present invention encode codon optimized wild type Pol (without Pro activity) 
and various codon optimized inactivated HTV-l Pol proteins. The HIV-1 pol open 
reading disclosed herein are especially preferred for pharmaceutical uses, especially 
for human administration as delivered via a recombinant adenoviral vaccine, 

15 especially an enhanced first generation recombinant adenoviral vaccine as described 
herein. Several embodiments of this portion of the invention are provided in detail 
below, namely DNA molecules which comprise a HIV-1 pol open reading frame, 
whether encoding full length pol or a modification or fusion as described herein, 
wherein the codon usage has been optimized for expression in a mammal, especially a 

20 human. Again, these DNA sequences are positioned appropriately within a 

recombinant adenoviral vector, such as the exemplified recombinant adenoviral vector 
described herein, so as to promote expression of the respective HTV-l Pol gene of 
interest, and subsequent to administration, elicit a host CTL and Th response. Again, 
these preferred, but in no way limiting, pol genes are as disclosed herein and 

25 essentially as described in U.S. Application Serial No. 09/745,221 , filed December 
21, 2000 and PCT International Application PCT/USOO/34724, also filed December 
21, 2000, both documents which are hereby incorporated by reference. 

A third series of vaccine plasmids which are useful in a combined modality 
and/or prime/boost regimen are vaccine plasmids which encode an HIV nef antigen or 

30 biologically and/or immunologically relevant modification thereof. As noted 

elsewhere, preferred vaccine plasmids contain a codon optinoized form of nef and also 
preferably comprise a nucleotide sequence which encodes a Nef antigen selected from 
the group of Nef antigens as shown in SEQ ID NOs: 10, 12, 14 and 16. These 
preferred nef coding regions are disclosed herein, as well as being described in U.S. 

35 Application Serial No. 09/738,782, filed December 15, 2000 and PCT International 
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Application PCT/USOO/34162, also filed December 15, 2000, both documents which 
are hereby incorporated by reference. 

Therefore, the adenoviral vaccines and plasmid DNA vaccines of this 
invention may be administered alone, or may be part of a prime and boost 
5 administration regimen. A mixed modality priming and booster inoculation scheme 
will result in an enhanced immune response, particularly is pre-existing anti-vector 
immune responses are present. This one aspect of this invention is a method of 
priming a subject with the plasmid vaccine by administering the plasmid vaccine at 
least one time, allowing a predetermined length of time to pass, and then boosting by 

10 administering the adenoviral vaccine. Multiple primings typically, 1-4, are usually 
employed, although more may be used. The length of time between priming and 
boost may typically vary from about four months to a year, but other time frames may 
be used. In experiments with rhesus monkeys, the animals were primed four times 
with plasmid vaccines, then were boosted 4 months later with the adenoviral vaccine. 

15 Their cellular immune response was notably higher than that of animals which had 
only received adenoviral vaccine. The use of a priming regimen may be particularly 
preferred in situations where a person has a pre-existing anti-adenovirus immune 
response. 

Furthermore and in the alternative, multiple HTV-l viral antigens, such as the 

20 MRKAdS adenoviral vaccines disclosed herein, may be ligated into a proper shuttle 
plasmid for generation of a pre-adenoviral plasmid comprising multiple open reading 
frames. For example a trivalent vector may comprise a gag-pol-nef fusion, in either a 
E3(-) or E3(+) background, preferably a E3 deleted backbone, or possible a "2+1" 
divalent vaccine, such as a gag-pol fusion (i.e., codon optimized p55 gag and 

25 inactivated optimized pol; Example 29 and Table 25) within the same MRKAdS 
backbone, with each open reading frame being operatively linked to a distinct 
promoter and transcription termination sequence. Alternatively, the two open reading 
firames may be operatively linked to a single promoter, with the open reading frames 
operatively linked by an internal ribosome entry sequence (IRES), as disclosed in 

30 International Publication No. WO 95/24485, which is hereby incorporated by 
reference. Figure 9 shows that the use of multiple promoters and termination 
sequences provide for similar growth properties, while Figure 28 shows that these 
MRKAd5gag-'based vectors are also stable at least through passage 21. In the absence 
of the use of IRES-based technology, it is preferred that a distinct promoter be used to 

35 support each respective open reading frame, so as to best preserve vector stability. As 
examples, and certainly not as limitations, potential multiple transgene vaccines may 
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include a three transgene vector such as hCMV-gagpol-bGHpA + mCMV-nef-SPA in 
an E3 deleted backbone or hCMV-gagpol-bGHpA + mCMV-nef-SPA(E3+). 
Potential "2+1" divalent vaccines of the present invention might be a hCMV-gag- 
bGHpA + mCMV-nef-SPA in an E3+ backbone (vector #1) in combination with 

5 hCMV-pol-bOHpA in an E3+ backbone (vector #2), with all transgenes in the El 
parallel orientation. Fusion constructs other than the gag-pol fusion described above 
are also suitable for use in various divalent vaccine strategies and can be composed of 
any two HIV antigens fused to one another (e.g.,, nef-pol and gag-nef). These 
adenoviral compositions are, as above, preferably delivered along with an adenoviral 

10 composition comprising an additional HTV antigen in order to diversify the immune 
response generated upon adnainistration. Therefore, a multivalent vaccine delivered 
in a single, or possible second, adenoviral vector is certainly contemplated as part of 
the present invention. Again, this mode of administration is another example of 
whereby an efficaceous adenovirus-based HIV-1 vaccine may be administered via a 

15 combined modality regime. It is important to note, however, that in terms of deciding 
on an insert for the disclosed adenoviral vectors, due consideration must be dedicated 
to the effective packaging limitations of the adenovirus vehicle. Adenovirus has been 
shown to exhibit an upper cloning capacity limit of approximately 105% of the 
wildtype Ad5 sequence. 

20 Regardless of the gene chosen for expression, it is preferred that the sequence 

be "optimized" for expression in a human cellular environment. A "triplet" codon of 
four possible nucleotide bases can exist in 64 variant forms. That these forms provide 
the message for only 20 different amino acids (as well as transcription initiation and 
tennination) means that some amino acids can be coded for by more than one codon. 

25 Indeed, some amino acids have as many as six "redundant", alternative codons while 
some others have a single, required codon. For reasons not completely understood, 
alternative codons are not at all uniformly present in the endogenous DNA of 
differing types of cells and there appears to exist variable natural hierarchy or 
"preference" for certain codons in certain types of cells. As one example, the amino 

30 acid leucine is specified by any of six DNA codons including CTA, CTC, CTG, CTT, 
. TTA, and TTG (which correspond, respectively, to the mRNA codons, CUA, CUC, 
CUG, CUU, UUA and UUG). Exhaustive analysis of genome codon frequencies for 
microorganisms has revealed endogenous DNA ofE. coli most conmionly contains 
the CTG leucine-specif ying codon, while the DNA of yeasts and slime molds most 

35 commonly includes a TTA leucine-specifying codon. In view of this hierarchy, it is 
generally held that the likelihood of obtaining high levels of expression of a leucine- 
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rich polypeptide by an E. coli host will depend to some extent on the frequency of 
codon use. For example, a gene rich in TTA codons will in all probability be pooriy 
expressed in E, coli, whereas a CTG rich gene will probably highly express the 
polypeptide. Similarly, when yeast cells are the projected transformation host cells 
5 for expression of a leucine-rich polypeptide, a preferred codon for use in an inserted 
DNA would be TTA. 

The implications of codon preference phenomena on recombinant DNA 
techniques are manifest, and the phenomenon may serve to explain many prior 
failures to achieve high expression levels of exogenous genes in successfully 

10 transformed host organisms— a less "preferred" codon may be repeatedly present in the 
inserted gene and the host cell machinery for expression may not operate as 
efficiently. This phenomenon suggests that synthetic genes which have been designed 
to include a projected host cell's preferred codons provide a preferred form of foreign 
genetic material for practice of recombinant DNA techniques. Thus, one aspect of 

15 this invention is an adenovirus vector or adenovirus vector in some combination with 
a vaccine plasmid where both specifically include a gene which is codon optimized 
for expression in a human cellular environment. As noted herein, a preferred gene 
for use in the instant invention is a codon-optimized HIV gene and, particularly, HIV 
gag, pol or nef 

20 Adenoviral vectors in accordance with the instant invention can be constructed 

using known techniques, such as those reviewed in Hitt et al, 1997 "Human 
Adenovirus Vectors for Gene Transfer into Mammalian Cells" Advances in 
Pliarmacology 40:137-206, which is hereby incorporated by reference. 

In constructing the adenoviral vectors of this invention, it is often convenient 

25 to insert them into a plasmid or shuttle vector. These techniques are known and 
described in Hitt et al., supra. This invention specifically includes both the 
adenovims and the adenovirus when inserted into a shuttle plasmid. 

Preferred shuttle vectors contain an adenoviral portion and a plasmid portion. 
The adenoviral portion is essentially the same as the adenovirus vector discussed 

30 supra, containing adenoviral sequences (with non-functional or deleted El and E3 

regions) and the gene expression cassette, flanked by convenient restriction sites. The 
plasmid portion of the shuttle vector often contains an antibiotic resistance marker 
under transcriptional control of a prokaryotic promoter so that expression of the 
antibiotic does not occur in eukaryotic cells. Ampicillin resistance genes, neomycin 

35 resistance genes and other pharmaceutically acceptable antibiotic resistance markers 
may be used. To aid in the high level production of the polynucleotide by 
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fermentation in prokaryotic organisms, it is advantageous for the shuttle vector to 
contain a prokaryotic origin of replication and be of high copy number. A number of 
conmiercially available prokaryotic cloning vectors provide these benefits. It is 
desirable to remove non-essential DNA sequences. It is also desirable that the vectors 

5 not be able to replicate in eukaryotic cells. This minimizes the risk of integration of 
polynucleotide vaccine sequences into the recipients' genome. Tissue-specific 
promoters or enhancers may be used whenever it is desirable to Hmit expression of the 
polynucleotide to a particular tissue type. 

In one embodiment of this invention, the pre-plasmids (e.g., pMRKAdSpol, 

10 pMRKAdSnef and pMRKAdSgag were generated by homologous recombination 
using the MRKHVE3 (and MRKHVO for the E3- version) backbones and the 
appropriate shuttle vector, as shown for pMRKAdSpol in Figure 22 and for 
pMRKAdSnef in Figure 23. The plasraid in linear form is capable of replication after 
entering the PER,C6® cells and virus is produced. The infected cells and media were 

15 harvested after viral replication was complete. 

Viral vectors can be propagated in various El complementing cell Unes, 
including the known cell lines 293 and PER.C6®. Both these cell Unes express the 
adenoviral El gene product. PER.C6® is described in WO 97/00326 (published 
January 3, 1997) and issued U.S. Patent No. 6,033,908, both of which are hereby 

20 incorporated by reference. It is a primary human retinoblast cell line transduced with 
an El gene segment that complements the production of replication deficient (FG) 
adenovims, but is designed to prevent generation of replication competent adenovirus 
by homologous recombination. Cells of particular interest have been stably 
transformed with a transgene that encodes the AD5E1 A and ElB gene, like PER.C6®, 

25 from 459 bp to 3510 bp inclusive. 293 cells are described in Graham et al., 1977 J. 
Gen, Virol 36:59-72, which is hereby incorporated by reference. As stated above, 
consideration must be given to the adenoviral sequences present in the 
complementing cell line used. It is important that the sequences not overlap with that 
present in the vector if the possibility of recombination is to be minimized. 

30 It has been found that vectors generated in accordance with the above 

description are more, effective in inducing an immune response and, thus, constitute 
very promising vaccine candidates. More particularly, it has been found that first 
generation adenoviral vectors in accordance with the above description carrying a 
codon-optimized HTV gag gene, regulated with a strong heterologous promoter can be 

35 used as human anti-HIV vaccines, and are capable of inducing immune responses. 
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Standard techniques of molecular biology for preparing and purifying DNA 
constructs enable the preparation of the DNA immunogens of this invention. 

A vaccine composition comprising an adenoviral vector in accordance with 
the instant invention may contain physiologically acceptable components, such as 
5 buflFer, normal saline or phosphate buffered saline, sucrose, other salts and 

polysorbate. One preferred formulation has: 2.5-10 mM TRIS buffer, preferably 
about 5 mM TRIS buffer; 25-100 mM NaCl, preferably about 75 mM NaCl; 2.5-10% 
sucrose, preferably about 5% sucrose; 0.01 -2 mMMgCh; andO.001%-0.01% 
polysorbate 80 (plant derived). The pH should range from about 7.0-9.0, preferably 

10 about 8.0. One skilled in the art will appreciate that other conventional vaccine 
excipients may also be used it make the formulation. The preferred formulation 
contains SmMTMS, 75 mM NaCl, 5% sucrose, ImMMgCla, 0.005% polysorbate 80 
at pH 8.0 This has a pH and divalent cation composition which is near the optimum 
for Ad5 stability and mmimizes the potential for adsorption of virus to a glass surface. 

15 It does not cause tissue irritation upon intramuscular injection. It is preferably frozen 
until use. 

The amount of adenoviral particles in the vaccine composition to be 
introduced into a vaccine recipient will depend on the strength of the transcriptional 
and translational promoters used and on the immunogenicity of the expressed gene 

20 product. In general, an immunologically or prophylactically effective dose of 1x10^ 
to 1x10^^ particles and preferably about 1x10^^ to 1x10*^ particles is administered 
directly into muscle tissue. Subcutaneous injection, intradermal introduction, 
impression through the skin, and other modes of administration such as 
intraperitoneal, intravenous, or inhalation delivery are also contemplated. It is also 

25 contemplated that booster vaccinations are to be provided. Following vaccination 

with mv adenoviral vector, boosting with a subsequent HIV adenoviral vector and/or 
plasmid may be desirable. Parenteral administration, such as intravenous, 
intramuscular, subcutaneous or other means of administration of interleuldn-12 
protein, concurrently with or subsequent to parenteral introduction of the vaccine 

30 compositions of this invention is also advantageous. 

The adenoviral vector and/or vaccine plasmids of this invention 
polynucleotide may be unassociated with any proteins, adjuvants or other agents 
which impact on the recipients' immune system. In this case, it is desirable for the 
vector to be in a physiologically acceptable solution, such as, but not limited to, sterile 

35 saline or sterile buffered saline. Alternatively, the vector may be associated with an 
adjuvant known in the art to boost immune responses (i.e., a "biologically effective" 
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adjuvant), such as a protein or other carrier. Vaccine plasmids of this invention may, 
for instance, be delivered in saline (e.g., PBS) with or without an adjuvant Preferred 
adjuvants are Alum or CRL1005 Block Copolymer. Agents which assist in the 
cellular uptake of DNA, such as, but not limited to, calcium ions, may also be used to 
5 advantage. These agents are generally referred to herein as transfection facilitating 
reagents and phannaceutically acceptable carriers. Techniques for coating 
microprojectiles coated with polynucleotide are known in the art and are also useful in 
connection with this invention. 

This invention also includes a prime and boost regimen wherein a first 

10 adenoviral vector is administered, then a booster dose is given. The booster dose may 
be repeated at selected time intervals. Alternatively, a preferred inoculation scheme 
comprises priming with a first adenovirus serotype and then boosting with a second 
adenovirus serotype. More preferably, the inoculation scheme comprises priming 
with a jBrst adenovirus serotype and then boosting with a second adenovirus serotype, 

15 wherein the first and second adenovirus serotypes are classified within separate 

subgroups of adenoviruses. The above prime/boost schemes are particularly preferred 
m those situations where a preexisting immunity is identified to the adenoviral vector 
of choice. In this type of scheme, the individual or population of individuals is 
primed with an adenovirus of a serotype other than that to which the preexisting 

20 immunity is identified. This enables the first adenovirus to effectuate sufficient 

expression of the transgene while evading existing immunity to the second adenovirus 
(the boosting adenovirus) and, further, allows for the subsequent delivery of the 
transgene via the boosting adenovirus to be more effective. Adenovirus serotype 5 is 
one example of a virus to which such a scheme might be desirable. In accordance 

25 with this invention, therefore, one might decide to prime with a non-group C 

adenovirus (e.g., Adl2, a group A adenovirus, Ad24, a group D adenovirus, or Ad35, 
a group B adenovirus) to evade anti-Ad5 immunity and then boost with Ad5, a group 
C adenovirus. Another preferred embodiment involves administration of a different 
adenovirus (including non-human adenovirus) vaccine followed by administration of 

30 the adenoviral vaccines disclosed. In the alternative, a viral antigen of interest can be 
first delivered via a viral vaccine other than an adenovirus-based vaccine, and then 
followed with die adenoviral vaccine disclosed. Alternative viral vaccines include but 
are not limited to pox virus and Venezuelan equine encephilitis virus. 

A large body of human and animal data supports the importance of cellular 

35 inmiune responses, especially CTL in controlling (or eliminating) HTV infection. In 
humans, very high levels of CTL develop following primary infection and correlate 
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with the control of viremia. Several small groups of individuals have been described 
who are repeatedly exposed to HIV by remain uninfected; CTL has been noted in 
several of these cohorts. In the SIV model of HIV infection, CTL similarly develops 
following primary infection, and it has been demonstrated that addition of anti-CD8 
5 monoclonal antibody abrogated this control of infection and leads to disease 

progression. This invention uses adenoviral vaccines alone or in combination with 
plasmid vaccines to induce CTL. 

The following non-limiting Examples are presented to better illustrate the 
invention. 

10 

EXAMPLE 1 

Removal of the Intron A Portion of the hCMV Promoter 
GMP grade pVUnsHIVgag was used as the starting material to amphfy the 
hCMV promoter. PVUnsHIVgag is a plasmid comprising the CMV inmaediate-early 

15 (IE) promoter and intron A, a full-length codon-optimized HIV gag gene, a bovine 
growth hormone-derived polyadenylation and transcriptional termination sequence, 
3 minimal pUC backbone; see Montgomery et al^ supra for a description of the 
plasmid backbone. The amplification was performed with primers suitably positioned 
to flank the hCMV promoter. A 5' primer was placed upstream of the Mscl site of 

20 the hCMV promoter and a 3' primer (designed to contain the BglR recognition 

sequence) was placed 3' of the hCMV promoter. The resulting PGR product (using 
high fidelity Tag polymerase) which encompassed the entire hCMV promoter (minus 
intron A) was cloned into TOPO PCR blunt vector and then removed by double 
digestion with Mscl and Bg/II. This fragment was then cloned back into the original 

25 GMP grade pVUnsHIVgag plasmid from which the original promoter, intron A, and 
the gag gene were removed following Mscl and BglK digestion. This ligation 
reaction resulted in the construction of a hCMV promoter (minus intron A) + bGHpA 
expression cassette within the original pVUnsHIVgag vector backbone. This vector 
is designated pVIJnsCMV(no intron). 

30 The FLgag gene was excised from pVUnsHIVgag using BglE digestion and 

the 1,526 bp gene was gel purified and cloned into pVlJnsCMV(no intron) at the 
BglR site. Colonies were screened using Smal restriction enzymes to identify clones 
that carried the Flgag gene in the correct orientation. This plasmid, designated 
pVlJnsCMV(no intron)-FLgag-bGHpA, was fully sequenced to confirm sequence 

35 integrity. 
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Two additional transgenes were also constructed. The plasmid, 
pVl JnsCMV(no intron)-ELgag-SPA, is identical to pVlJnsCMV(no intron>FLgag- 
bGHpA except that the bovine growth hormone polyadenylation signal has been 
replaced with a short synthetic polyA signal (SPA) of 50 nucleotides in length. The 
5 sequence of the SPA is as shown, with the essential components (poly(A) site, (GT)n, 
and (T)a; respectively) underlined: 

AATAA A AGATCTTTATTTTCATTAGATC TGTGTG TTGGTTTTTTGTGTG 
(SEQIDNO:18). 

The plasmid, pVlJns-mCMV-FLgag-bGHpA, is identical to the 
10 pVlJnsCMV(no intron)-FLgag-bGHpA except that the hCMV promoter has been 
removed and replaced with the murine CMV (mCMV) promoter. 

Figure 3 diagranmiatically shows the new transgene constructs in comparison 
with the original transgene. 

15 EXAMPLE 2 

Gag Expression Assay for Modified Gag Transgenes 
Gag Elisa was performed on culture supematants obtained from transient 
tissue culture transfection experiments in which the two new hCMV-containing 
plasmid constructs, pVlJnsCMV(no intron)-FLgag-bGHpA and pVlJnsCMV(no 

20 intron)-FLgag-SPA, both devoid of intron A, were compared to pVlJnsHIVgag 

which, as noted above possesses the intron A as part of the hCMV promoter. Table 2 
below shows the in vitro gag expression data of the new gag plasmids compared with 
the GMP grade original plasmid. The results displayed in Table 2 show that both of 
the new hCMV gag plasmid constructs have expression capacities comparable to the 

25 original plasmid construct which contains the intron A portion of the hCMV 
promoter. 
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Table 2: In vitro DNA transfection of oiigina 


and new plasmid HIV-1 gag constructs. 


Plasmid 


llg gag/10e6 COS cells/5ug DNA/48 hr 


HIVFL-gagPR9901' 


10.8 


PVDns-hCMV-ELgag-bGHpA" 


16.6 


pVlJns-hCMV-FLgag-SPA"** 


12.0 



^GMP grade pVlJns-hCMVintronA-FLgag-bGHpA. 
5 ^ New plasmid constructions that have the intron A portion removed from the hCMV 
promoter. 

^ Jn this construct the bGH terminator has been replaced with the short synthetic 
polyadenylation signal (SPA) 



10 EXAMPUE 3 

Rodent (Balb/c) Study for Modified gag Transgenes 
A rodent study was performed on the two new plasmid constmcts 

described above - pVl JnsCMV(no intron)-FLgag-bGHpA and pVl JnsCMV(no 

intron)-FLgag-SPA - in order to compare them with the construct described above 
15 possessing the intron A portion of the CMV promoter, pVlJnsHIVgag. Gag antibody 

and Elispot responses (described in PCT International Application No. 

PCT/USOO/18332 (WO 01/02607) filed July 3, 2000, claiming priority to U.S. 

Provisional Application Serial No. 60/142,631, filed July 6, 1999 and U.S. 

Application Serial No. 60/148,981, filed August 13, 1999, all three applications which 
20 are hereby incorporated by reference) were measured. The results displayed in Table 

3 below, show that the new plasmid constructs behaved equivalently to the original 

construct in Balb/c mice with respect to their antibody and T-cell responses at both 

dosages of plasmid DNA tested, 20 |Xg and 200 ^g. 
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EXAMPUE4 

Table 3: HIV191 : Immunogenicity of VlJns-gag under different promoter and termination control 
elements. 



DNA° 

Promoter/terminator 


Dose, 
ug' 


Antl-p24 Titers 
(3 Wk PDI)"" 


SFC/IC^e Cells 
(4 Wk PDI)** 


GMT 


4SE 


-SE 


Media 


gagi 97-205 


p24 


HIVFL>gagPR9901 


200 


12800 


4652 


3412 


2(2) 


129(19) 


30(11) 


(GMP grade) 


20 


5572 


1574 


1227 


0 


56(9) 


25(6) 


pVUns-hCMV- 


200 


11143 


2831 


2257 


0 


98(5) 


12(6) 


FL-gag-bGHpA 


20 


7352 


2808 


2032 


0 


73(9) 


11(6) 


pVUns-hCMV- 


200 


16890 


5815 


4326 


1(1) 


94(4) 


26(7) 


FL-gag-SPA 


20 


5971 


5361 


2825 


0 


85(17) 


38(10) 


Najve 


0 


123 


50 


36 


0 


0 


0 



*in PBS 

^.m. Injections into both quads, 50 per quad 
°n=10;GMT, geometric mean titer; SE, standard, error 

^ns5, pooled spleens; mean of triplicate wells and standard, deviation, in parentheses; 



Construction of the Modified Shuttle Vector -"MRKpdelEl Shuttle" 
The modifications to the original Ad5 shuttle vector (pdelElsplA; a vector 
comprising Ad5 sequences from basepairs 1-341 and 3524-5798, with a multiple 

5 cloning region between nucleotides 341 and 3524 of Ad5, included the following 
three manipulations carried out in sequential cloning steps as follows: 
(1) The left ITR region was extended to include the Pad site at the junction between 
the vector backbone and the adenovirus left ITR sequences. This allow for easier 
manipulations using the bacterial homologous recombination system. 

10 (2) The packaging region was extended to include sequences of the wild-type (WT) 
adenovirus from 342 bp to 450 bp inclusive. 

(3) The area downstream of pIX was extended 13 nucleotides (i.e., nucleotides 3511- 
3523 inclusive). 

These modifications (Figure 4) effectively reduced the size of the El deletion without 
15 overlapping with any part of the El A/EIB gene present in the transformed PER.C6® 
cell line. .AH manipulations were performed by modifying the Ad shuttle vector 
pdelElsplA. 

Once the modifications were made to the shuttle vector, the changes were 
incorporated into the original Ad5 adenovector backbones (pAdHVO and pAdHVE3) 
20 by bacterial homologous recombination using E. coli BJ5183 chemically competent 
cells. 
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EXAMPLES 

CoDstruction of Modified Adenovector Backbones rE3+ and 
The original adenovectors pAdHVO (comprising all Ad5 sequences except 
those nucleotides encompassing the El andE3 regions ) and pADHVE3 (comprising 
5 all Ad5 sequences except those nucleotides encompassing the El region), were each 
reconstructed so diat they contained the modifications to the El region. This was 
accomplished by digesting the newly modified shuttle vector (MRKpdelEl shuttle) 
with Pad and BstZllOl and isolating the 2,734 bp fragment which corresponds to the 
adenovirus sequence. This fragment was co-transformed with DNA from either Clal 

10 linearized pAdHVO CE3- adenovector) or Clal linearized pAdHVE3 

(E3+adenovector) into E. coli BJ5183 competent cells. At least two colonies from 
each transformation were selected and grown in Terrific™ broth for 6-8 hours until 
turbidity was reached. DNA was extracted from each cell pellet and then transformed 
into E. coli XLl competent cells. One colony from each transformation was selected 

15 and grown for plasmid DNA purification. The plasmid was analyzed by restriction 
digestions to identify correct clones. The modified adenovectors were designated 
^^KpAdHVO (E3- plasmid) and MRKpAdHVE3 (E3+ plasmid). Virus from these 
new adenovectors (MRKHVO and MRKHVE3, respectively) as well as the old 
version of the adenovectors were generated in the PER.C6® cell lines to acconraiodate 

20 the following series of viral competition experiments. In addition, the multiple 
cloning site of the original shuttle vector contained Clal , BamHI, Xho I, EcoRV, 
Hindni, Sal I, and Bgl II sites. This MCS was replaced with a new MCS containing 
Not I, Cla I, EcoRV and Asc I sites. This new MCS has been transferred to the 
MRKpAdHVO and MRKpAdHVE3 pre-plasmids along with the modification made 

25 to the packaging region and pIX gene. 

EXAMPLE6 

Analvsis of the Effect of the Packaging Signal Extension 
To study the effects of the modifications made to the El deletion region, the 
30 viruses obtained from the original backbone (pAdHVE3) and the new backbone 
(MRKpAdHVE3) were mixed together in equal MOI ratios (1:1 and 5:5) and 
passaged through several rounds; see Figure 5, Expt.#l. Both of the viruses in the 
experiment contained the E3 gene intact and did not contain a transgene. The only 
difference between the two viruses was within the region of the El deletion. 
35 Following the coinfection of the viruses at PI (passage 1), the mixtures were 

propagated through an additional 4 passages at which time the cells were harvested 
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and the virus extracted and purified by CsCl banding. The viral DNA was extracted 
and digested with HindUL and the digestion products were then radioactively labeled. 
For the controls, the respective pre-plasmids (pAdHVE3 ("OLD E3+"); 
MRKpAdHVES ("NEW E3+")) were also digested with HiJidSl (and Pad to remove 
5 the vector backbone) and subsequently labeled with [^^P]dATP. The radioactively 
labeled digestion products were subjected to gel electrophoresis and the gel was dried 
down onto Whatman paper before being exposed to autoradiographic film. Figure 6 
clearly shows that the new adenovirus which has the addition made to the packaging 
signal region has a growth advantage compared with the original adenovirus. In the 
10 experiments performed (at either ratio tested), only the digestion bands pertaining to 
the newly modified virus were present. The diagnostic band of size 3,206 (from the 
new virus) was clearly present. However, there was no evidence of the diagnostic 
band of size 2,737 bp expected from the original virus. 

15 EXAMPLE 7 

Analvsis of the Effect of the E3 Gene 
The second set of the virus competition study involved mixing equal MOI 
ratio (1:1) of the newly modified viruses, that obtained from MRKpAdHVO and 
MRKpAdHVE3 (Figure 5, Expt. #2), In this set, both viruses had the new 

20 modifications made to the El deletion. The first virus (that from MRKpAdHVO) 
does not contain an E3 gene. The second virus (that from MRKpAdHVE3) does 
contain the E3 gene. Neither of the viruses contain a transgene. Following co- 
infection of the viruses, the mixtures were propagated through an additional 4 
passages at which time the cells were harvested and the total virus extracted and 

25 purified by CsCl banding. The viral DNA was extracted and digested with HineSSL 
and the digestion products were then radioactively labeled. For the controls, the 
respective pre-plasmids MRKpAdHVO ("NEW E3-"); MRKpAdHVE3 ("NEW 
E3+") were also digested with HincLWl (and Pad to remove the vector backbone) and 
then labeled with [33p]dATP. The radioactively labeled digestion products were 

30 subjected to gel electrophoresis and the gel was dried down onto Whatman paper 

before being exposed to autoradiographic film. Figure 6 shows the results of the viral 
DNA analysis of the E3+ virus and E3- virus mixing experiment. The diagnostic 
band coiresponding to the E3+ virus (5,665 bp) was present in greater amount 
compared with the diagnostic band of 3,010 bp corresponding to the E3- virus. This 

35 ' indicates that the virus that contains the E3 gene is able to amplify more rapidly 
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compared with the vims that does not contam an E3 gene. This increased 
amplification capacity has been confirmed by growth studies; see Table 4 below. 

EXAMPLES 

Construction of the new shuttle vector containing modified gag transgene - 
"MRKndelEl^CMVfno intronVFLgag-bGHpA" 
The modified plasmid pVlJnsCMV(no intron)-FLgag-bGHpA was digested 
with Mscl overnight and then digested with Sfil for 2 hours at SO^'C. The DNA was 
then treated with Mungbean nuclease for 30 mins at 30°C. The DNA mixture was 
desalted using the Qiaex n kit and then Klenow treated for 30 mins at 3TC to fully 
blunt the ends of the transgene fragment. The 2,559 bp transgene fragment was then 
gel purified. The modified shuttle vector (MRKpdelEl shuttle) was linearized by 
digestion with EcoRV, treated with calf intestinal phosphatase and the resulting 6,479 
bp fragment was then gel purified. The two purified fragments were then ligated 
together and several dozen clones were screened to check for insertion of the 
transgene within the shuttle vector. Diagnostic restriction digestion was performed to 
Hentify those clones carrying the transgene in the El parallel and El anti-parallel 
orientation. This strategy was followed to clone in the other gag transgenes in the 
MRKpdelEl shuttie vector. 

EXAMPLE 9 
Construction of the MRK FG Adenovectors 
The shuttle vector containing the KV-l gag transgene in the El parallel 
orientation, MRKpdelEl -CMV(no intron)-FLgag-bGHpA, was digested with PacL 
The reaction mixture was digested with BsfZlll. The 5,291 bp fragment was purified 
by gel extraction. The MRKpAdHVES plasmid was digested witfi Clal overnight at 
37°C and gel purified. About 100 ng of tiie 5,290 bp shuttie +transgene fiagment and 
-100 ng of linearized MIlKpAdHVE3 DNA were co-transformed into E. coli BJ5183 
chemically competent cells. -Several clones were selected and grown in 2 ml 
Terrific™ broth for 6-8 hours, until turbidity was reached. The total DNA from the 
cell pellet was purified using Qiagen alkaline lysis and phenol chloroform method. 
The DNA was precipitated with isopropanol and resuspended in 20 fJil dH20. A 2 pil 
aliquot of this DNA was transformed into E, coli XL-1 competent cells. A single 
colony from each separate transformation was selected and grown overnight in 3 ml 
LB +100 |Lig/ml ampicillin. The DNA was isolated using Qiagen colunms, A positive 
clone was identified by digestion with tiie restriction enzyme BsiEJl which cleaves 
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within the gag gene as well as the plasmid backbone. The pre-plasmid clone is 
designated MRKpAdHVE3+CMV(no intron)-FLgag-bGHpA and is 37,498 bp in size. 
This strategy was followed to generate E3- and E3+ versions of each of the other gag 
transgene constructions in both El parallel and El anti-parallel versions. Figures 7A, 
5 7B and 7C show the various combinations of adenovectors constructed. 

EXAMPLE 10 
Plasmid Competition Studies 
A series of plasmid competition studies was carried out. Briefly, the screening 

10 of the various combinations of new constructs was performed by mixing equal 

amounts of each of two competing plasmids. In the experiment shown in Figure 8 A, 
plasmids containing the same transgene but in different orientations were mixed 
together to create a "competition" between the two plasmids. The aim was to look at 
the effects of transgene orientation. In the experiment shown in Figure 8B, plasmids 

15 containing different polyadenylation signals (but in the same orientation) were mixed 
together in equal amounts. The aim was to assess effects of poly A signals. Following 
the initial transfection, the virus was passaged through ten rounds and the viral DNA 
analyzed by radioactive restriction analysis. 

Analysis of the viral species from the plasmid mixing experiment (Figure 8A) 

20 showed that adenovectors which had the transgene inserted in the El parallel 

orientation amplified better and were able to out-compete the adenovirus which had 
the transgene inserted in the El anti-parallel orientation. Viral DNA analysis of the 
mixtures at passage 3 and certainly at passage 6, showed a greater ratio of the virus 
carrying the transgene in the El parallel orientation compared with the El antiparallel 

25 version. By passage 10, the only viral species observed was the adenovector with the 
transgene in the El parallel orientation for both transgenes tested (hCMV(no intron)- 
FLgag-bGHpA and hCMV(no intron)-FLgag-SPA). 

Analysis of the viral species from the plasmid mixing experiment #2 (Figure 
8B) at passages 3 and 6 showed that the polyadenylation signals tested (bGHpA and 

30 SPA) did not have an effect on the growth of the virus. Even at passage 10 the two 
viral species in the mixture were still present in equal amounts. 
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EXAMPLE 11 

Vims generation of an enhanced adenoviral construct - "MRK Ad5 HTV-lgag" 
The results obtained from the competition study allowed us to make the 
following conclusions: (1) The packaging signal extension is beneficial; (2) Presence 
of E3 does enhance viral growth; (3) El parallel orientation is recommended; and (4) 
PolyA signals have no effect on the growth of the adenovirus. 

MRK Ad5 HTV-l gag exhibited the most desirable results. This construct 
contains the hCMV(no intron)-FLgag-bGHpA transgene inserted into the new E3+ 
adenovector backbone, MRKpAdHVE3, in the El parallel orientation. We have 
designated this adenovector MRK Ad5 HIV-1 gag. This constract was prepared as 
outlined below; 

The pre-plasmid MRKpAdHVE3+CMV(no intron)-FLgag-bGHpA was 
digested was Pad to release the vector backbone and 3.3 jxg was transfected by 
calcium phosphate method (Amersham Pharmacia Biotech.) in a 6 cm dish containing 
PER.C6® cells at -60% confluence. Once CPE was reached (7-10 days), the culture 
was Iteeze/thawed three times and the cell debris pelleted. 1 ml of this cell lysate was 
"*5ed to infect into a 6 cm dish containing PER.C6® cells at 80-90% confluence. Once 
CPE was reached, the culture was freeze/thawed three times and the cell debris 
pelleted. The cell lysate was then used to infect a 15 cm dish containing PER.C6® 
cells at 80-90% confluence. This infection procedure was continued and expanded at 
passage 6. The virus was then extracted from the cell pellet by CsCl method. Two 
bandings were performed (3-gradient CsCl followed by a continuous CsCl gradient). 
Following the second banding, the virus was dialyzed in A105 buffer. Viral DNA 
was extracted using pronase treatment followed by phenol chloroform. The viral 
DNA was then digested with HindDI and radioactively labeled with [33p]dATP. 
Following gel electrophoresis to separate the digestion products the gel was dried 
down on Whatman paper and then subjected to autoradiography. The digestion 
products were compared with the digestion products from the pre-plasmid (that had 
been digested with PacllHindm prior to labeling). The expected sizes were 
observed, indicating that the virus had been successfully rescued. This strategy was 
used to rescue virus from each of the various adenovector plasmid constructs 
prepared. 
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EXAMPLE 12 
Stability Analyses 

To determine whether the various adenoyector constructs (e.g., MRK Ad5 
HIV-1 gag) show genetic stability, the yiruses were each passaged continually. The 
5 viral DNA was analyzed at passages 3, 6 and 10. Each virus maintained its correct 
genetic structure. In addition, the stability of the MRK Ad5 HTV-l gag was analyzed 
under propagation conditions similar to that performed in large scale production. For 
this analysis, the transfections of MRK Ad5 HIV-1 gag as well as three other 
adenoviral vectors were repeated and the virus was purified at P3. The three other 

10 adenovectors were as follows: (1) that comprising hCMV(no intron)-Figag with a 
bGHpA terminator in an E3- adenovector backbone; (2) that comprising hCMV(no 
intron)-Rgag with a SPA tenoiination signal in an E3+ adenovector backbone, and 
that comprising a mCMV-Flgag with a bGHpA tenninator in an E3+ adenovector 
backbone. All of the vectors have the transgene inserted in the El parallel orientation. 

15 Viral DNA was analyzed by radioactive restriction analysis to confirm that it was 
correct before being delivered to fermentation cell culture for continued passaging in 
^erum-free media. At P5 each of the four viruses were purified and the viral DNA 
extracted for analysis by the restriction digestion and radiolabeUng procedure. This 
virus has subsequently been used in a series of studies (in vitro gag expression in COS 

20 cells, rodent study and rhesus monkey study) as will be described below. The viruses 
from P5 are shown in Figure 9. 

The passaging under serum-free conditions was continued for the 
MRKIIVE3 (transgene-less, obtained from MRKpAdHVE3 pie-plasmid) and the 
MRKAdSHIV-lgag (obtained from MRKpAdHVE3+CMV(no intron)-FLgag- 

25 bGHpA pre-plasmid) viruses. Figure 10 shows viral DNA analysis by radioactive 
restriction digestion at passage 11 for MRKHVE3, MRKAdSHTV-lgagES-, and 
passage 11 and 12 for MRKAd5HIV-lgag. Aside from the first lane which is the 
DNA marker lane, the next three lanes are virus from the pre-plasmid controls 
(controls based on the original virus) - MRKpAdHVE3 (also referred to as 

30 "pMRKHVE3"), MRKpAdHVE3+CMV(no intron)-FLgag-bGHpA, and 

pMRKAd5gag(E3-), respectively. As seen in Figure 10, each of the viral DNA 
samples show the expected bands with no extraneous bands showing. This signifies 
that there are no major variant adenovirus species present that can be detected by 
autoradiography. 

35 Figure 1 1 shows the results of viral competition study between MRKHVE3 

and MRKAd5HIV-lgag. These viruses were mixed together at equal MOT (140 viral 



46 



wo 02/22080 



PCTAJSOl/28861 



particles each; 280 vp total) at passage 6 and continued to be passaged until Pll. 
Aside from the first lane which is the DNA marker lane, the next two lanes are the 
pre-plasmid controls obtained fi:om MRKpAdHVE3 and MRKpAdHVE3+CMV(no 
intron)-ELgag-bGHpA. The next two lanes are the viral DNA from the starting viral 
5 material at passage six. The last two lanes are the competition studies performed in 
duplicate. The data in Figure 11 shows the effect the gag transgene in culture. 
Growth of a MRELAdSgag vims was compared with growth of a "transgene-less" 
MRKHVE3, These two viruses were infected at the same MOI (i.e. 140 vp each) at 
passage 6 and then passaged through to passage 11 and the viral pool was analyzed by 
10 radioactive restriction analysis. The data shows that one virus did not out compete the 
other. Therefore, the gag transgene did not show obvious signs of toxicity to the 
adenovirus. 

Analysis by Hindni digestion shows that each virus specie is present in 
approximately equal amounts. As above, there does not appear to be signs of any 
15 extraneous bands. Figure 12 shows higher passage numbers for MRKAd5HIV-lgag 
grown under serum-containing conditions. The genome integrity again has been 
nnaintained and there is no evidence of rearrangements, even at the highest passage 
level (P21). 

Each of the four vectors shown in Figure 9 were analyzed for amplification 
20 capacity. Table 4 below shows the QPA analysis used in the estimation of viral 

amplification ratios at P4. The determination of the araphfication ratio for the 

original HTV-l gag construct is based on the clinical lot at P12. It has been shown 

that ampUfication rates increases with higher passage number for the original virus. 

The reason for this observation is due to the emergence of variants which exhibit 
25 increased growth rates compared to the intact adenovector. With continued passaging 

of the original Ad gag vector, the level of variants increases and hence amplification 

rates increase also. 

The MRK Ad5 HIV-1 gag virus has also been continually passaged under 

process conditions (i.e., serum-free media). Viral DNA extracted from passages 11 
30 and 12 show no evidence of rearrangement. 
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Table 4: 

Amplification Ratios Based on AEX and QPA Analysis of 
Virus Amplification from Passage 3 to Passage 4. 



Ad gag construct 


Amplification Ratio 


MRKAdSgag 


470 


HCMV-Flgas-bGHpA [E3-] 


115 


HCMV-Hgag-SPA [E3+] 


320 


mCMV-ELgag-bGHpA DE3+] 


420 


Original construct * 


40-50 



5 

* This estimation is based on the clinical lot growth characteristics at Passage 12. 

EXAMPLE 13 

10 Analytical Evaluation of the enhanced Ad5 Constructs 

To study the effects of the transgene and the E3 gene on virus amplification, 
the enhanced adenoviral vector, MRK Ad5 HIV-1 gag, along with its transgene-less 
- --^5 >n (MRKpAdHVE3) and its E3- version (MRK Ad5 HTV-l gag E3-), was 
studied for several passages under serum-free conditions. Table 5A shows the 

15 amplification ratios determined for passages P3 to P8 for MRK Ad5 HTV-l gag. 
Within a certain MOI range, it has been determined that the virus output is directly 
proportional to the virus input. Therefore, the greater the number of virus particles 
per cell at infection, the greater the virus amount produced. Viral amplification ratios, 
on the other hand, are inversely proportional to the virus input. The lower the virus 

20 input, the greater the amplification ratio. 

Table 5B shows the amplification rates of the new E3+ vector backbone 
MRKpAdHVE3. It has a significantly lower rate of amplification compared with the 
gag transgene containing version. This may be contributed to the larger size MRK 
Ad5 HTV-l gag since it contains the transgene. This inclusion of the transgene brings 

25 the size of the adenoviras closer to the size of a wild type Ad5 vims. It is well known 
that adenoviruses amplify best when they are at close to their wild type genomic size. 
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Wild type Ad5 is 35,935 bp. The MRKpAdHVE3 is 32, 905 bp in length. The 
enhanced adenovector MRK AdS HTV-l gag is 35,453bp (See Figure 14 for vector 
map; see also Figure 15A-X show the complete pre-adenoviral vector sequence, 
which includes an additional 2,021 bp of the vector backbone).' 
5 Table 5C shows the ampUfication rates of the new E3- gag containing virus 

MRK AdS HTV-l gag E3-. Once again, this virus shows lower growth rate than the 
enhanced adenoviral vector. This may be attributed to the decreased sized of this 
virus (due to the E3 gene deletion) compared with wild type Ad5. The MRK AdS 
HIV-1 gag E3- virus is 32,810 bp in length. This can be compared with the wild type 
10 AdS which is 35,935 bp and MRK AdS fflV-l gag which is 35,453 bp in length. 
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Table 5A: Amplification ratios determined by AEX and QPA for MRKAdSgag over several 
continuous passaging in serum free media. Following P5, two replicate samples were taken (rep- 
1 and rep-2) and analyzed. 



MRKAdSgag repl 





Xv(1<f csSs/cr 
rnreclton 


1), Vlabaty(%) 
Harvosl 


Harvest Tbne 
h.pil. 


CeO Passage 
Number 


roer 
lO'^vprtnlculhtrB 


THor 

10*Vpte9B 


QPA 
ItyTCtDiartnl 


Ratio 


Ampttflcation 
Ratio 


AEX 
Inlsmal Conlro) 


P4 


1.49,61% 


0.58. 50% 


44 


46 


Bl7 




1.72 


SO 


470 
fMOr-125) 




P5 


1.38,93% 


a66,47% 


48 


49 


a? 


4.9 


1.38 


49 


170 


P6 


1.04, 94% 


0,68. 77% 


47 


48 


Sl8 


5.6 


1.42 


41 


200 


P7 


1.50,84% 


a96,61% 


49.5 


SO 




1.4 


0.97 


40 


SO 


P7 


1.09, 97% 


B9% 


50 


52 


5.2 


4.7 


1.70 


31 


170 


P8 


1.03. 94% 


0.86,64% 


47.5 


54 


9.0 


8.7 


1.10 


82 


310 


PO 


0.89, 95% 


a99, 73% 


47J 


56 


4.4 


4.9 


1.03 


43 


175 


3.12 
2.84 


P10 


1.09, 91% 


i.oe. 66% 


47.5 


58 


3.0 


Z6 


1.16 


26 


100 


2.70 
2.60 


PI I 


M9,ee% 


0.08.65% 


47 


60 


ae 


3.0 


1.15 


31 


110 


2.70 
2.70 


P12 


0.98, 91% 


0.85, 63% 


47.5 


47 


5.4 


5.5 


1.20 


45 


200 


2.66 
2.60 


P13 


1.00, SB% 


0.70. 67% 


49 


49 


&8 


6.8 


1.11 


62 


210 


3.16 
3.16 


P14 


1.94,92% 


0.88,67% 


46 


53 


8.6 


4.4 






160 


3.26 
3.27 


P16 


0.97. 86% 


0.64, 66% 


47 


47 


619 


7.1 






260 


3.12 
2.91 



Table 5B: Amplification ratios determined by AEX and QPA for MRKHVE3 over several 
continuous passaging in serum free media. MRKHVE3 is the new vector backbone whicli does 
NOT carry a transgene. 

MRKHVE3 





Xv(10*colls/rr 
Infection 


I), VlablDty C%) 
HarvBSt 


Harvosmmo 
h.p.L 


Cefl Passage 
Number 


Tiler 
tO'^vp/inl eutlure 


Tliar 
lO'vpteen 


QPA 
loT TCID^I 


Ralto 
AEXrQPA 


Amptiltcaion 
Ratio 


AEX 
Internal Control 


P4 


1.10,97% 


1.28,79% 


49 


64 


4.1 


3.6 


1.70 


25 


300 
fMOl = 125J 




PS 


0.92,69% 


1.18.77% 


47 


48 


4.3 


4.7 


1.24 


35 


170 


P6 


t.ss.ee% 


1.26.76% 


49.5 


50 


tJ2 


0.8 


0.S6 


21 


30 


P6 


1.09, 97% 


1.11.81% 


49 


52 


4.0 


3.6 


1.16 


34 


130 


P7 


1.17.91% 


1.22.91% 


47.5 


54 


3.7 


3.2 


0.S0 


74 


110 


P8 


0.98.68% 


1.41,83% 


48 


96 


2A 


2.1 


0.47 


45 


76 


3.12 
2.84 


pg 


1.20.69% 


1J26.B1% 


47.8 


58 


0.8 


0.7 


0.29 


28 


2S 


i70 
2.60 


P10 


0.99. 82% 


1.&S. 88% 


47 


60 


2.3 


2.3 


0>43 


53 


80 


2.70 
2.70 


P11 


1.07. 96% 


1^.83% 


48 


47 


2.7 


ZJ& 


0.41 


66 


90 


2.68 
2.60 


P12 


0.60.91% 


1.14, 60% 


49.5 


49 


5S 


7.4 


0.48 


123 


260 


3.18 
3.18 


P13 


1.96. 95% 


1.14.65% 


45.5 


S3 


aa 


3.0 






110 


3J!8 
3.27 


P14 


0.87, 66% 


1.03. 98% 


485 


47 


8.4 


9.7 






350 


3.12 
2.91 


PIS 


0.87, 99% 


0.G7, 69% 


49.5 


49 


5L3 


6.1 






216 


2.78 
2.52 
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Table 5C. Amplification ratios determined by AEX and QPA for MRKAd5gag(E3-) over several 
continuous passaging in serum free media. This construct Is Identical to the MRKAdSgag constmct 
except that this version is DELETED of the E3 gene. 

5 

MRKAd5gag(E3) 





Xv(10*oe»s/n 
Infection 


il), V1abinty(%) 
Harvest 


Harvest Time 


Cell Passago 
Number 


TItef 
10" vp/ni culture 


Tiler 
10* vpfoell 


QPA 
IO'TCIDm/itiI 


Ratio 
AEXrQPA 


AmpBtlcaUon 
RaUo 


AEX 
Internal Corbel 


PA 


1.62.77% 


1.12.62% 


47.S 


46 


2.0 


1.2 


0.92 


20 


100 
(M0I=12S) 




P5 


1.16.32% 


0.62. 43% 


49 


49 


3.3 


2J9 


0^9 


34 


100 


P6 


1.71» 86% 


0.20, 10% 


49 


SO 


4.7 


2.7 


1.70 


26 


100 


PB 


1.09,87% 


0.63, 54% 


49.5 


52 


5.4 


5.0 


1.76 


31 


180 


P7 


1.17.91% 


0.98. 72% 


47.50 


54 


7.1 


6.1 


0.67 


106 


220 


pa 


0.98,88% 


0.77, 48% 


48 


56 


3.1 


3J! 


0.66 


47 


115 


3.12 
2.64 


P9 


1.20,89% 


1.03,72% 


48 


58 


1.8 


1.5 


0.57 


32 


55 


2.70 
2.60 


P10 


a99.82% 


0.60, 62% 


46.6 


60 


3.2 


32 


0.68 


47 


115 


2.70 
2.70 


P11 


1.07,96% 


0.9B, 70% 


4B.5 


47 


5.9 


6.E 


0.68 


87 


200 


2.86 
2.60 


P12 


0.B0, 91% 


0.67, 59% 


50 


49 


5.1 


6.4 


0.72 


71 


230 


3.18 
3.18 


P13 


1.96.95% 


0.91.59% 


45.5 


S3 


7.4 


3.8 






135 


3.28 
3.27 


PH 


097. 96% 


0.81,74% 


48 


47 


6.8 


7.0 






250 


3.12 
2.91 


Pt5 


0.87. 99% 


0.84, 66% 


49 


49 


4.8 


5.5 






196 


2.78 
2JS2 



EXAMPLE 14 
Gag Expression Analysis of the Novel Constructs 
In vitro gag analysis of the MRK Ad5 HTV-l gag and the original HIV-gag 
10 vectors (research and clinical lot) show comparable gag expression. The clinical lot 
shows only a sUghtly reduced gag expression level. The most noticeable difference is 
with the mCMV vector. This vector shows roughly 3 fold lower expression levels 
compared with the other vectors tested (which all contain hCMV promoters). The 
mCMV-FLgag with bGHpA assay was performed three times using different 
15 propagation and purification lots and it consistently exhibited weaker gag expression. 

EXAMPLE 15 

Evaluation of MRK Ad5 HTV-l gag and Other gag-Containing 
. - Adenovectors in Balb/c Mice 
20 Cohorts of 10 balb/c mice were vaccinated intramuscularly with escalating 

doses of MRK Ad5 HTV-l gag, and the research and clinical lots of original 
Ad5HIV-lgag. Serum samples were collected 3 weeks post dose 1 and analyzed by 
anti-p24 sandwich ELISA. 
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Anti-p24 titers in mice that received MRK Ad5 fflV-l gag (107 and 109 
vp(viral particle) doses) were comparable (Figure 13) to those of the research lot of 
AdSHTV-l gag, for which much of the early rhesus data were generated on. These 
titers were also comparable when E3 is deleted (MRKAd5hCMVgagbGHpA(E3-)) or 
5 SPA is substituted for bGHpA terminator (MRKAd5 hCMV~gag-SPA (E3+)) or 
murine CMV promoter is used in place of hCMV (MRKAdS mCMV-gag-bGHpA 
(E3+)) in the MRKAdS backbone. 

The results shown in Table 7 indicate that the three other vectors ( in addition 
to the preferred vector, MRK Ad5 HIV-l gag, are also capable of inducing strong 
10 anti-gag antibody responses in mice. Interestingly enough, while the mCMV-FLgag 
construct containing bOHpA and E3+ in an El parallel orientation showed lowest gag 
expression in the COS cell in vitro infection (Table 6) in comparison with the other 
vectors tested, it generated the greatest anti-gag antibody response this in vivo Balb/c 
study. Table 7 also shows a dose response in anti-gag antibody production in both the 
15 research and the clinical lot. As expected, the clinical lot shows reduced anti-gag 
. antibody induction at each dosage level compared to the same dosage used for the 
research lot. 

Table 6: In vitro analysis for gag expression in COS cells by Elisa assay. 

20 



Viral Vectors" 


Jig gag/4.8xl0e5 COS/10e8 parts/48hr 


MRKAdSgag" 


1.40 


Qinical lot AdSgag' 


1.28 


Research lot AdSgag** 


1.32 


MCMVFL-gagbGHpA' 


0.42 



Aafionm absorbance readings taken for viral particle determinations. 
^ MRKAdSgag was produced in serum free conditions and purified at P5. 
' CUnical lot# AdSgagFNOOOl 
25 Research AdSFLgag lot# 6399 

* mCMVFL-gagbGHpA was produced in semm free conditions and purified at P5. 
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Table 7: mHlV020 Anti-p24 Ab Titers in Balb/c mice (n=10) vaccinated with various Arigag 
constructs and lots (3 week post dose1). 



Group 
ID 


Vaccine 


Dose 
(vp) 


GMT 


SE upper 


SE lower 


1 
2 


^MRKAdSgag 

» 


10^7 
10A9 


25600 
409600 


5877 
94028 


4780 
76473 


3 
4 


hCMV FL-oaa bGHoA FES-l 

II 


10^7 
10A9 


7352 
235253 


2077 
69767 


1620 
47659 


5 
6 


hCMV FL-gag SPA [E3+]-4 


10^7 
IC^S 


12800 
310419 


9905 
99181 


236 
75165 


7 
8 


''mCMV FL-gag bGHpA [E3+] -» 
It 


10^7 
10^9 


44572 
941014 


23504 
239068 


15389 
190836 


9 
10 


'^hCMV FL-gag bGHpA [E3-] <- 


10A7 
IQAQ 


3676 
117627 


934 
17491 


745 
15227 


11 
12 
13 
14 


research lot hCMV IntronA FL-gag bGHpA [E3-] <- 
It 

n 
II 


10A6 
10^7 
10^8 
10^9 


528 
14703 
58813 
204800 


262 
5274 
14942 
53232 


175 
3882 
11915 
42250 


15 
16 
17 
18 


clinical lot hCMVintronA FL>gag bGHpA [E3-] <- 

M 

a 

M 


10^ 
10^7 

icy^ 

10A9 


230 
4222 
19401 
89144 


82 
3405 
3939 
25187 


61 
1138 
3274 
19639 


19 


NaTve 


none 


93 


7 


6 



*2x50 hL i.m. (quad) injections/animal 
?.!.s: Youll, Chen, Casimlro 
Vaccination: T. Toner, Q. Sii 
Assay: M. Chen 



^The structure of MRKAdSgag is: hCMVFL-gagbOHpA [E3+] The same lot of MRKAdSgag 
used in this rodent study was used in the Rhesus monkey study (Tables 7 and 8). 
h"he same lot of mCMVFL-gagbGHpA[E3+] used In the in vitro study (Table 6) ws used here. 
*Thls construct was designed by Volker Sandig. It contains a shorter version of the hCMV 
promoter than that used In the MRK constructs. The adenovector backbone Is identical to the 
original backbone used In the original Adgag vector. Expression at 10e7 dose from this vector Is 7 
fold lower then the same dose of the MRKAd5gag and 4 fold lower than the research loL 



EXAMPLE 16 

Comparison of Humoral and Cellular Responses Towards the Original Ad-gag 
Construct with the New MRK Ad5 HTV-l gag in Rhesus Monkeys 
Cohorts of 3 rhesus monkeys were vaccinated intramuscularly with MRK AdS 
HIV-1 gag or the clinical AdSgag bulk at two doses, 10^^ vp and 10^ vp. 
Immunizations were conducted at week 0, 4, and 25. Serum and PBMC samples were 
collected at selected time points. The serum sample were assayed for anti-p24 Ab 
titers (using competitive based assay) and the PBMCs for antigen-specific IFN- 
gamma secretion following overnight stimulation with gag 20-mer peptide pool (via 
ElJSpot assay). 

The results shown in Table 8 indicate comparable responses with respect to 
the generation of anti-gag antibodies. The frequencies of gag-specific T cells in 
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peripheral blood assummarized in Table 9 demonstrate a strong cellular immune 
response generated after a single dose with the new construct MRK Ad5 HIV-1 gag. 
The responses are also boostable with second dose of the same vector. The vector is 
also able to induce CD8+ T cell responses (as evident by remaining spot counts after 
CD4+ depletion of PBMCs) which are responsible for cytotoxic activity. 

Table 8 Anti-p24 antibody titers (in mMU/mL) in rhesus macaques inununized with 



;ag-€xpressing adenovectors (Protocol HIV203). 



Vaccine 


PrG 


Wk4 


Wk8 


Wkl2 


Wk16 


Wk20 


Wk 25 


Wk28 


MRKAdSgog**. lO'^ll VD 


















97N010 


<10 


118 


5528 


11523 


7062 


21997 


ND 


51593 


97N116 


<10 


62 


772 


1447 


1562 


2174 


ND 


20029 


98X007 


<]0 


66 


3353 


6166 


6845 


3719 


ND 


24031 




















MRKAdSgoQ. 10''9VD 


















97m 20 


<10 


51 


204 


318 


366 


4B2 


ND 


6650 


97N144 


<1G 


18 


118 


274 


706 


888 


ND 


7136 


98X008 


<10 


15 


444 


386 


996 


1072 


ND 


12851 




















Ad5aaq^ Clinlcd Lot. lO'^l 1 VD 


















97X001 


<10 


87 


2579 


4718 


7174 


7250 


ND 


69226 


97N146 


<10 


72 


3604 


7380 


7526 


18906 


ND 


60283 


98X009 


<10 


78 


4183 


3946 


3124 


6956 


ND 


26226 
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Table 9. Number of gag-specific T cells per million peripheral blood mononuclear 
cells (PBMCs) in rhesus monkeys immunized with gag-expressing adenovectors. 
Also included are those frequencies in PBMCs depleted of CD4* T cells. 
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The adenovectors described herein and, particularly, MRK Ad5 HIV-1 gag, 
represent very promising HTV-gag adenovectors with respect to their enhanced 
growth characteristics in both serum and, more importantly, in serum-free media 

10 conditions. In comparison with the current HIV-1 gag adenovector construct, MRK 
Ad5 HIV-1 gag shows a 5-10 fold increased amplification rate. We have shown that 
it is genetically stable at passage 21. This construct is able to generate significant 
cellular immune responses in vivo even at a relatively low dose of 10'^9 vp. The 
potency of the MRKAdSgag construct is comparable to, if not better than the original 

15 inV-lgag vector as shown in this rhesus monkey study. 

EXAMPIJE17 

CODON OPTIMIZED HIV-1 POL AND CODON OPTIMZED 
mv-l POL MODIFICATIONS 
20 The open reading frames for the various synthetic pol genes disclosed herein 

comprise coding sequences for the reverse transcriptase (or RT which consists of a 
polymerase and RNase H activity) and integrase (IN). The protein sequence is based 
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on that of Hxb2r, a clonal isolate of lOB; this sequence has been shown to be closest 
to the consensus clade B sequence with only 16 nonidentical residues out of 848 
(Korber, et al., 1998, Human retroviruses and AIDS, Los Alamos National 
Laboratory, Los Alamos, New Mexico). The skilled artisan will understand after 
5 review of this specification that any available HIV-1 or HIV-2 strain provides a 
potential template for the generation of HTV pol DNA vaccine constructs disclosed 
herein. It is further noted that the protease gene is excluded from the DNA vaccine 
constructs of the present invention to insure safety from any residual protease activity 
in spite of mutational inactivation. The design of the gene sequences for both wild- 

10 type (wt-pol) and inactivated pol (lA-pol) incorporates the use of human preferred 
("humanized") codons for each amino acid residue in the sequence in order to 
maximize in vivo mammalian expression (Lathe, 1985, J. MoL Biol. 183:1-12). As 
can be discerned by inspecting the codon usage in SEQ ID NOs: 1, 3, 5 and 7, the 
following codon usage for mammalian optimization is prefeired: Met (ATG), Gly 

15 (GGC), Lys (AAG), Trp (TGG), Ser (TCC), Arg (AGG), Val (GTG), Pro (CCC), Thr 
(ACC), Glu (GAG); Leu (CTG), His (CAC), lie (ATC), Asn (AAC), Cys (TGC), Ala 
fGCC), Gin (CAG), Phe (TTC) and Tyr (TAG). For an additional discussion relating 
to manmialian (human) codon optimization, see WO 97/31115 (PCT/US97/02294), 
which, as noted elsewhere in this specification, is hereby incorporated by reference. It 

20 is intended that the skilled artisan may use alternative versions of codon optimization 
or may omit this step when generating HIV pol vaccine constracts within the scope of 
the present invention. Therefore, the present invention also relates to non-codon 
optimized versions of DNA molecules and associated recombinant adenoviral HIV 
vaccines which encode the various wild type and modified forms of the HIV Pol 

25 protein disclosed herein. However, codon optimization of these constracts is a 
preferred embodiment of this invention. 

A particular embodiment of this portion of the invention comprisies codon 
optimized nucleotide sequences which encode wt-pol DNA constructs (herein, "wt- 
pol" or "wt-pol (codon optimized))" wherein DNA sequences encoding the protease 

30 (PR) activity are deleted, leaving codon optimized "wild type" sequences which 

encode RT (reverse transcriptase and RNase H activity) and IN integrase activity. A 
DNA molecule which encodes this protein is disclosed herein as SEQ ID NO:l, the 
open reading frame being contained from an initiating Met residue at nucleotides 10- 
12 to a termination codon from nucleotides 2560-2562. SEQ ID N0:1 is as follows: 

35 AGATCTACCA TGGCCCCCAT CTCCCCCATT GAGACTGTGC CTGTGAAGCT GAAGCCTGGC 
ATGGATGGCC CCAAGGTGAA GCAGTGGCCC CTGACTGAGG AGAAGATCAA GGCCCTGGTG 
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GAAATCTGCA CTGAGATGGA 
TACAACACCC CTGTGTTTGC 
GACTTCAGGG AGCTGAACAA 
CACCCCGCTG GCCTGAAGAA 
TTCTCTGTGC CCCTGGATGA 
AACAATGAGA CCCCTGGCAT 
TCCCCTGCCA TCTTCCAGTC 
CCTGACATTG TGATCTACCA 
GGGCAGCACA GGACCAAGAT 
ACCCCTGACA AGAAGCACCA 
CCC6ACAAGT GGACTGTGCA 
GACATCCAGA AGCTGGTGGG 
GTGAGGCAGC TGTGCAAGCT 
ACTGAGGAGG CTGAGCTGGA 
GGGGTGTACT AT6ACCCCTC 
CAGTGGACCT ACCAAATCTA 
APGATGAGGG GGGCCCACAC 
ACCACTGAGT CCATTGTGAT 
GAGACCTGGG AGACCTGGTG 
TTTGTGAACA CCCCCCCCCT 
GGGGCTGAGA CCTTCTATGT 
GGCTATGTGA CCAACAGGGG. 
AAGACTGAGC TCCAGGCCAT 
GTGACTGACT CCCAGTATGC 
GAGCTGGTGA ACCAGATCAT 
GTGCCTGCCC ACAAGGGCAT 
ATCAGGAAGG TGCTGTTCCT 
CACTCCAACT G6AGGGCTAT 
ATTGTGGCCT CCTGTGACAA 
TGCTCCCCTG GCATCTGGCA 
GCTGTGCATG TGGCCTGCGG 
GAGACTGCCT ACTTCCTGCT 
GACAATGGCT CCAACTTCAC 
AAGCAGGAGT TTGGCATCCC 
AAGGAGCTGA AGAAGATCAT 
GTGCAGATGG CTGTGTTCAT 



GAAGGAGGGC AAAATCTCCA 
CATCAAGAAG AAGGACTCCA 
GAGGACCCAG GACTTCTGGG 
GAAGAAGTCT GTGACTGTGC 
GGACTTCAGG AAGTACACTG 
CAGGTACCAG TACAATGTGC 
CTCCATGACC AAGATCCTGG 
GTACATGGAT GACCTGTATG 
TGAGGAGCTG AGGCAGCACC 
GAAGGAGCCC CCCTTCCTGT 
GCCCATTGTG CTGCCTGAGA 
CAAGCTGAAC TGGGCCTCCC 
GCT6AGGGGC ACCAAGGCCC 
GCTGGCTGAG AACAGGGAGA 
CAAGGACCTG ATTGCTGAGA 
CCAGGAGCCC TTCAAGAACC 
CAATGATGTG AAGGAGCTGA 
CTGGGGCAAG ACCCCCAAGT 
GACTGAGTAC TGGCAGGCCA 
GGTGAAGCTG TGGTACCAGC 
GGATGGGGCT GCCAACAGGG 
CAGGCAGAAG GTGGTGACCC 
CTACCTGGCC CTCCAGGACT 
CCTGGGCATC ATCCAGGCCC 
TGAGGAGCTG ATCAA6AAGG 
TGGGGGCAAT GAGCAGGTGG 
GGATGGCATT GACAAGGCCC 
GGCCTCTGAC TTCAACCTGC 
GTGCCAGCTG AAGGGGGAGG 
GCTGGACTGC ACCCACCTGG 
CTACATTGAG GCTGAGGTGA 
GAAGCTGGCT GGCAGGTGGC 
TGGGGCCACA GTGAGGGCTG 
CTACAACCCC CAGTCCCAGG 
TGGGCAGGTG AGGGACCAGG 
CCACAACTTC AAGAGGAAGG 



AGATTGGCCC C6AGAACCCC 
CCAAGTGGAG GAAGCTGGT6 
AGGTGCAGCT GGGCATCCCC 
TGGATGTGGG GGATGCCTAC 
CCTTCACCAT CCCCTCCATC 
TGCCCCAGGG CTGGAAGGGC 
AGCCCTTCAG GAAGCAGAAC 
TGGGCTCTGA CCTGGAGATT 
TGCTGAGGTG GGGCCTGACC 
GGATGGGCTA TGAGCTGCAC 
AGGACTCCTG GACTGTGAAT 
AAATCTACCC TGGCATCAAG 
TGACTGAGGT GATCCCCCTG 
TCCTGAAGGA GCCTGTGCAT 
TCCAGAAGCA GGGCCAGGGC 
TGAAGACTGG CAAGTATGCC 
CTGAGGCTGT GCAGAAGATC 
TCAAGCTGCC CATCCAGAAG 
CCTGGATCCC TGAGTGGGAG 
TGGAGAAGGA GCCCATTGTG 
AGACCAAGCT GGGCAAGGCT 
TGACTGACAC CACCAACCAG 
CTGGCCTGGA GGTGAACATT 
AGCCTGATCA GTCTGAGTCT 
AGAAGGTGTA CCTGGCCTGG 
ACAAGCTGGT GTCTGCTGGC 
AGGATGAGCA TGAGAAGTAC 
CCCCTGTGGT GGCTAAGGAG 
CCATGCATGG GCAGGTGGAC 
AGGGCAAGGT GATCCTGGTG 
TCCCTGCTGA GACAGGCCAG 
CTGTGAAGAC CATCCACACT 
CCTGCTGGTG GGCTGGCATC 
GGGTGGTGGA GTCCATGAAC 
CTGAGCACCT GAAGACAGCT 
GGGGCATCGG GGGCTACTCC 
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GCTGGGGAGA GGATTGTGGA CATCATTGCC ACAGACATCC AGACCAAGGA GCTCCAGAAG 
CAGATCACCA AGATCCAGAA CTTCAGGGTG TACTACAGGG ACTCCAGGAA CCCCCTGTGG 

AAGGGCCCTG CCAAGCTGCT GTGGAAGGGG. GAGGGGGCTG TGGTGATCCA GGACAACTCT 
GACATCAAGG TGGTGCCCAG GAGGAAGGCC AAGATCATCA GGGACTATGG CAAGCAGATG 
5 GCTGGGGATG ACTGTGTGGC CTCCAGGCAG GATGAGGACT AAAGCCCGGG CAGATCT (SEQ 
ID N0:1) . 

The open reading fiaxn& of the wild type pol construct disclosed as SEQ ID 
N0:1 contains 850 amino acids, disclosed herein as SEQ ID N0:2, as follows: 





Met 


Ala 


Pro 


He 


Ser 


Pro 


He 


Glu 


Thr 


Val 


Pro 


Val 


Lys 


Leu 


Lys 


Pro 


10 


Gly 


Met 


Asp 


Gly 


Pro 


Lys 


Val 


Lys 


Gin 


Trp 


Pro 


Leu 


Thr 


Glu 


Glu 


Lys 




lie 


Lys 


Ala 


Leu 


Val 


Glu 


He 


Cys 


Thr 


Glu 


Met 


Glu 


Lys 


Glu 


Gly 


Lys 




lie 


Ser 


Lys 


He 


Gly 


Pro 


Glu 


Asn 


Pro 


Tyr 


Asn 


Thr 


Pro 


Val 


Phe 


Ala 




He 


Lys 


Lys 


Lys 


Asp 


Ser 


Thr 


Lys 


Trp 


Arg 


Lys 


Leu 


Val 


Asp 


Phe 


Arg 




Glu 


Leu 


Asn 


Lys 


Arg 


Thr 


Gin 


Asp 


Phe 


Trp 


Glu 


Val 


Gin 


Leu 


Gly 


He 


15 


Pro 


His 


Pro 


Ala 


Gly 


Leu 


Lys 


Lys 


Lys 


Lys 


Ser 


Val 


Thr 


Val 


Leu 


Asp 




Val 


Gly 


Asp 


Ala 


Tyr 


Phe 


Ser 


Val 


Pro 


Leu 


Asp 


Glu 


Asp 


Phe 


Arg 


Lys 




Tyr 


Thr 


Ala 


Phe 


Thr 


He 


Pro 


Ser 


He 


Asn 


Asn 


Glu 


Thr 


Pro 


Gly 


He 




Arg 


Tyr 


Gin 


Tyr 


Asn 


Val 


Leu 


Pro 


Gin 


Gly 


Trp 


Lys 


Gly 


Ser 


Pro 


Ala 




He 


Phe 


Gin 


Ser 


Ser 


Met 


Thr 


Lys 


He 


Leu 


Glu 


Pro 


Phe 


Arg 


Lys 


Gin 


20 


Asn 


Pro 


Asp 


He 


Val 


He 


Tyr 


Gin 


Tyr 


Met 


Asp 


Asp 


Leu 


Tyr 


Val 


Gly 




Ser 


Asp 


Leu 


Glu 


He 


Gly 


Gin 


His 


Arg 


Thr 


Lys 


He 


Glu 


Glu 


Leu 


Arg 




Gin 


His 


Leu 


Leu 


Arg 


Trp 


Gly 


Leu 


Thr 


Thr 


Pro 


Asp 


Lys 


Lys 


His 


Gin 




Lys 


Glu 


Pro 


Pro 


Phe 


Leu 


Trp 


Met 


Gly 


Tyr 


Glu 


Leu 


His 


Pro 


Asp 


Lys 




Trp 


Thr 


Val 


Gin 


Pro 


He 


Val 


Leu 


Pro 


Glu 


Lys 


Asp 


Ser 


Trp 


Thr 


Val 


25 


Asn 


Asp 


He 


Gin 


Lys 


Leu 


Val 


Gly 


Lys 


Leu 


Asn 


Trp 


Ala 


Ser 


Gin 


He 




Tyr 


Pro 


Gly 


He 


Lys 


Val 


Arg 


Gin 


Leu 


Cys 


Lys 


Leu 


Leu 


Arg 


Gly 


Thr 




Lys 


Ala 


Leu 


Thr 


Glu 


Val 


He 


Pro 


Leu 


Thr 


Glu 


Glu 


Ala 


Glu 


Leu 


Glu 




Leu 


Ala 


Glu 


Asn 


Arg 


Glu 


He 


Leu 


Lys 


Glu 


Pro 


Val 


His 


Gly 


Val 


Tyr 




Tyr 


Asp 


Pro 


Ser 


Lys 


Asp 


Leu 


He 


Ala 


Glu 


He 


Gin 


Lys 


Gin 


Gly 


Gin 


30 


Gly 


Gin 


Trp 


Thr 


Tyr 


Gin 


He 


Tyr 


Gin 


Glu 


Pro 


Phe 


Lys 


Asn 


Leu 


Lys 




Thr 


Gly 


Lys 


Tyr 


Ala 


Arg 


Met 


Arg 


Gly 


Ala 


His 


Thr 


Asn 


Asp 


Val 


Lys 




Gin 


Leu 


Thr 


Glu 


Ala 


Val 


Gin 


Lys 


He 


Thr 


Thr 


Glu 


Ser 


He 


Val 


He 




Trp 


Gly 


Lys 


Thr 


Pro 


Lys 


Phe 


Lys 


Leu 


Pro 


He 


Gin 


Lys 


Glu 


Thr 


Trp 




Glu 


Thr 


Trp 


Trp 


Thr 


Glu 


Tyr 


Trp 


Gin 


Ala 


Thr 


Trp 


He 


Pro 


Glu 


T3rp 


35 


Glu 


Phe 


Val 


Asn 


Thr 


Pro 


Pro 


Leu 


Val 


Lys 


Leu 


Trp 


Tyr 


Gin 


Leu 


Glu 




Lys 


Glu 


Pro 


He 


Val 


Gly 


Ala 


Glu 


Thr 


Phe 


Tyr 


Val 


Asp 


Gly 


Ala 


Ala 
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Asn 


Arg 


Glu 


Thr 


Lys 


Leu 


Gly 


Lys 


Ala 


Gly 


Tyr 


Val 


Thr 


Asn 


Arg 


Gly 


Arg 


Gin 


Lys 


Val 


Val 


Thr 


Leu 


Thr 


Asp 


Thr 


Thr 


Asn 


Gin 


Lys 


Thr 


Glu 


Leu 


Gin 


Ala 


He 


Tyr 


Leu 


Ala 


Leu 


Gin 


Asp 


Ser 


Gly 


Leu 


Glu 


Val 


Asn 


lie 


Val 


Thr 


Asp 


Ser 


Gin 


Tyr 


Ala 


Leu 


Gly 


He 


He 


Gin 


Ala 


Gin 


Pro 


Asp 


Gin 


Ser 


Glu 


Ser 


Glu 


Leu 


Val 


Asn 


Gin 


He 


He 


Glu 


Gin 


Leu 


He 


Lys 


Lys 


Glu 


Lys 


Val 


Tyr 


Leu 


Ala 


Trp 


Val 


Pro 


Ala 


His 


Lys 


Gly 


He 


Gly 


Gly 


Asn 


Glu 


Gin 


Val 


Asp 


Lys 


Leu 


Val 


Ser 


Ala 


Gly 


He 


Arg 


Lys 


Val 


Leu 


Phe 


Leu 


Asp 


Gly 


He 


Asp 


Lys 


Ala 


Gin 


Asp 


Glu 


His 


Glu 


Lys 


Tyr 


His 


Ser 


Asn 


Trp 


Arg 


Ala 


Met 


Ala 


Ser 


Asp 


Phe 


Asn 


Leu 


Pro 


Pro 


Val 


Val 


Ala 


Lys 


Glu 


He 


Val 


Ala 


Ser 


Cys 


Asp 


Lys 


Cys 


Gin 


Leu 


Lys 


Gly 


Glu 


Ala 


Met 


His 


Gly 


Gin 


Val 


Asp 


Cys 


Ser 


Pro 


Gly 


He 


Trp 


Gin 


Leu 


Asp 


Cys 


Thr 


His 


Leu 


Glu 


Gly 


Lys 


Val 


He 


Leu 


Val 


Ala 


Val 


His 


Val 


Ala 


Ser 


Gly 


Tyr 


He 


Glu 


Ala 


Glu 


Val 


He 


Pro 


Ala 


Glu 


Thr 


Gly 


Gin 


Glu 


Thr 


Ala 


Tyr 


Phe 


Leu 


Leu 


Lys 


Leu 


Ala 


Gly 


Arg 


Trp 


Pro 


Val 


Lys 


Thr 


He 


His 


Thr 


Asp 


Asn 


Gly 


Ser 


Asn 


Phe 


Thr 


Gly 


Ala 


Thr 


Val 


Arg 


Ala 


Ala 


Cys 


Trp 


Trp 


Ala 


Gly 


He 


Lys 


Gin 


Glu 


Phe 


Gly 


He 


Pro 


Tyr 


Asn 


Pro 


Gin 


Ser 


Gin 


Gly 


Val 


Val 


Glu 


Ser 


Met 


Asn 


Lys 


Glu 


Leu 


Lys 


Lys 


He 


He 


Gly 


Gin 


Val 


Arg 


Asp 


Gin 


Ala 


Glu 


His 


Leu 


Lys 


Thr 


Ala 


Val 


Gin 


Met 


Ala 


Val 


Phe 


He 


His 


Asn 


Phe 


Lys 


Arg 


Lys 


Gly 


Gly 


He 


Gly 


Gly 


Tyr 


Ser 


Ala 


Gly 


Glu 


Arg 


He 


Val 


Asp 


He 


He 


Ala 


Thr 


Asp 


He 


Gin 


Thr 


Lys 


Glu 


Leu 


Gin 


Lys 


Gin 


He 


Thr 


Lys 


He 


Gin 


Asn 


Fhe 


Arg 


Val 


Tyr 


Tyr 


Arg 


Asp 


Ser 


Arg 


Asn 


Pro 


Leu 


Trp 


Lys 


Gly 


Pro 


Ala 


Lys 


Leu 


Leu 


Trp 


Lys 


Gly 


Glu 


Gly 


Ala 


Val 


Val 


He 


Gin 


Asp 


Asn 


Ser 


Asp 


He 


Lys 


Val 


Val 


Pro 


Arg 


Arg 


Lys 


Ala 


Lys 


He 


He 


Arg 


Asp 


Tyr 


Gly 


Lys 


Gin 


Met 


Ala 


Gly 


Asp 


Asp 


Cys 


Val 


Ala 


Ser 


Arg 


Gin 


Asp 


Glu 


Asp 


(SEQ ID 


N0:2) . 























The present invention especially relates to an adenoviral vector vaccine which 
comprises a codon optimized HTV-l DNA pol construct wherein, in addition to 
deletion of the portion of the wild type sequence encoding the protease activity, a 

30 combination of active site residue mutations are introduced which are deleterious to 
HIV-1 pol (RT-RH-IN) activity of the expressed protein. Therefore, the present 
invention preferably relates to an adenoviral HTV-l DNA pol-based vaccine wherein 
the construct is devoid of DNA sequences encoding any PR activity, as well as 
containing a mutation(s) which at least partially, and preferably substantially, 

35 abolishes RT, RNase and/or IN activity. One type of HIV-1 pol mutant which is part 
and parcel of an adenoviral vector vaccine may include but is not limited to a mutated 
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DNA molecule comprising at least one nucleotide substitution which results in a point 
mutation which effectively alters an active site within the RT, RNase and/or IN 
regions of the expressed protein, resulting in at least substantially decreased 
enzymatic activity for the RT, RNase H and/or IN functions of HTV-l Pol. In a 
5 preferred embodiment of this portion of the invention, a HIV-1 DNA pol construct 
contains a mutation or mutations within the Pol coding region which effectively 
abolishes RT, RNase H and IN activity. An especially preferable HIV-1 DNA pol 
construct in a DNA molecule which contains at least one point mutation which alters 
the active site of the RT, RNase H and IN domains of Pol, such that each activity is at 

10 least substantially abolished. Such a HIV-1 Pol mutant will most likely comprise at 
least one point mutation in or around each catalytic domain responsible for RT, 
RNase H and IN activity, respectfully. To this end, an especially preferred HIV-1 
DNA pol construct is exemplified herein and contains nine codon substitution 
mutations which results in an inactivated Pol protein (lA Pol: SEQ ID N0:4, Figure 

15 17A-C) which has no PR, RT, RNase or IN activity, wherein three such point 

mutations reside within each of the RT, RNase and IN catalytic domains. Therefore, 
an especially preferred exemplification is an adenoviral vaccine which comprises, in 
an appropriate fashion, a DNA molecule which encodes lA-pol, which contains all 
nine mutations as shown below in Table 1. An additional preferred amino acid 

20 residue for substitution is Asp55 1, localized within the RNase domain of Pol. Any 
combination of the mutations disclosed herein may suitable and therefore may be 
utilized as an lA-Pol-based vaccine of the present invention. While addition and 
deletion mutations are contemplated and within the scope of the invention, the 
preferred mutation is a point mutation resulting in a substitution of the wild type 

25 anuno acid with an alternative amino acid residue. 



Table 1 



wt aa 


aa residue 


mutant aa 


enzvme function 


Asp 


112 


Ala 


RT 


Asp 


187 


Ala 


RT 


Asp 


188 


Ala 


RT 


Asp . 


445 


Ala 


RNase H 


Glu 


480 


Ala 


RNase H 


Asp 


500 


Ala 


RNase H 


Asp 


626 


Ala 


IN 


Asp 


678 


Ala 


IN 


Glu 


714 


Ala 


IN 
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It is preferred that point mutations be incorporated into the lApol mutant adenoviral 
vaccines of the present invention so as to lessen the possibility of altering epitopes in 
and around the active site(s) of KHV-l Pol. 

To this, end, SEQ K) N0:3 discloses the nucleotide sequence which codes for 
a codon optimized pol in addition to the nine mutations shown in Table 1, disclosed as 
follows, and referred to herein as "lApol": 



AGATCTACCA 


TGGCCCCCAT 


CTCCCCCATT 


GAGACTGTGC 


CTGTGAAGCT 


GAAGCCTGGC 


ATGGATGGCC 


CCAAGGTGAA 


GCAGTGGCCC 


CTGACTGAGG 


AGAAGATCAA 


GGCCCTGGTG 


GAAATCTGCA 


CTGAGATGGA 


GAAGGAGGGC 


AAAATCTCCA 


AGATTGGCCC 


CGAGAACCCC 


TACAACACCC 


CTGTGTTTGC 


CATCAAGAAG 


AAGGACTCCA 


CCAAGTGGAG 


GAAGCTGGTG 


GACTTCAGGG 


AGCTGAACAA 


6AGGACCCAG 


GACTTCTGGG 


AGGTGCAGCT 


GGGCATCCCC 


CACCCCGCTG 


GCCTGAAGAA 


GAAGAAGTCT 


GTGACTGTGC 


TGGCTGTGGG 


GGATGCCTAC 


TTCTCTGTGC 


CCCTGGATGA 


GGACTTCAGG 


AAGTACACTG 


CCTTGACCAT 


CCCCTCCATC 


AACAATGAGA 


CCCCTGGCAT 


CAGGTACCAG 


TACAATGTGC 


TGCCCCAGGG 


CTGGAAGGGC 


TCCCCTGCCA 


TCTTCCAGTC 


CTCCATGACC 


AAGATCCTGG 


AGCCCTTCAG 


GAAGCAGAAC 


CCTGACATTG 


TGATCTACCA 


GTACATGGCT 


GCCCTGTATG 


TGGGCTCTGA 


CCTGGAGATT 


-^GGCAGCACA 


GGACCAAGAT 


TGAGGAGCTG 


AGGCAGCACC 


TGCTGAGGTG 


GGGCCTGACC 


ACCCCTGACA 


AGAAGCACCA 


GAAGGAGCCC 


CCCTTCCTGT 


GGATGGGCTA 


TGAGCTGCAC 


CCC6ACAAGT 


GGACTGTGCA 


GCCCATTGTG 


CTGCCTGAGA 


AGGACTCCTG 


GACTGTGAAT 


GACATCCAGA 


AGCTGGTGGG 


CAAGCTGAAC 


TGGGCCTCCC 


AAATCTACCC 


TGGCATCAAG 


GTGAGGCAGC 


TGTGCAAGCT 


GCTGAGGGGC 


ACCAAGGCCC 


TGACTGAGGT 


GATCCCCCTG 


ACTGAGGAGG 


CTGAGCTGGA 


GCTGGCTGAG 


AACAGGGAGA 


TCCTGAAGGA 


GCCTGTGCAT 


GGGGTGTACT 


ATGACCCCTC 


CAAGGACCTG 


ATTGCTGAGA 


TCCAGAAGCA 


GGGCCAGGGC 


CAGTGGACCT 


ACCAAATCTA 


CCAGGAGCCC 


TTCAAGAACC 


TGAA6ACTGG 


CAAGTATGCC 


A6GATGAGGG 


GGGCCCACAC 


CAAT6ATGTG 


AAGCAGCTGA 


CTGAGGCTGT 


GCAGAAGATC 


ACCACTGAGT 


CCATTGTGAT 


CTGGGGCAAG 


ACCCCCAAGT 


TCAA6CTGCC 


CATCCAGAAG 


GAGACCTGGG 


AGACCTGGTG 


GACTGAGTAC 


TGGCAGGCCA 


CCTGGATCCC 


TGAGTGGGAG 


TTTGTGAACA 


CCCCCCCCCT 


GGTGAAGCTG 


TGGTACCAGC 


TGGAGAAGGA 


GCCCATTGTG 


GGGGCTGAGA 


CCTTCTAT6T 


GGCTGGGGCT 


GCCAACAGGG 


AGACCAAGCT 


GGGCAAGGCT 


GGCTATGTGA 


CCAACAGGGG 


CAGGCAGAAG 


GTGGTGACCC 


TGACTGACAC 


CACCAACCAG 


AAGACTGCCC 


TCCAGGCCAT- 


CTACCTGGCC 


CTCCAGGACT 


CTGGCCTGGA 


GGTGAACATT 


GTGACTGCCT 


CCCAGTATGC 


CCTGGGCATC 


ATCCAGGCCC 


AGCCTGATCA 


GTCTGAGTCT 


GAGCTGGTGA 


ACCAGATCAT 


TGAGGAGCTG 


ATCAAGAAGG 


AGAAGGTGTA 


CCTGGCCTGG 


GTGCCTGCCC 


ACAAGGGCAT 


TGGGGGCAAT 


GAGCAGGTGG 


ACAAGCTGGT 


GTCTGCTGGC 


ATCAGGAAGG 


TGCTGTTCCT 


GGATGGCATT 


GACAAGGCCC 


AGGATGAGCA 


TGAGAAGTAC 


CACTCCAACT 


GGAGGGCTAT 


GGCCTCTGAC 


TTCAACCTGC 


CCCCTGTGGT 


GGCTAAGGAG 
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ATTGTGGCCT CCTGTGACAA 
TGCTCCCCTG GCATCTGGCA 
GCTGTGCATG TGGCCTCCGG 
GAGACTGCCT ACTTCCTGCT 
5 GCCAATGGCT CCAACTTCAC 
AAGCAGGAGT TTGGCATCCC 
AAGGAGCTGA AGAAGATCAT 
GTGCAGATGG CTGTGTTCAT 
GCTGGGGAGA GGATTGTGGA 
10 CAGATCACCA AGATCCAGAA 
AAGGGCCCTG CCAAGCTGCT 
GACATCAAGG TGGTGCCCAG 
GCTGGGGATG ACT6TGTGGC 
N0:3). 

15 In order to produce the lA-pol-based adenoviral vaccines of the present 

invention, inactivation of the enzymatic functions was achieved by replacing a total of 
nine active site residues from the enzyme subunits with alanine side-chains. As 
shown in Table 1, all residues that comprise the catalytic triad of the polymerase, 
namely Aspll2, Aspl87, and Aspl88, were substituted with alanine (Ala) residues 

20 (Larder, et al., Ncaure 1987, 327: 716-717; Larder, et al., 1989, Proc. Natl Acad, Set 
1989, 86: 4803-4807). Three additional mutations were introduced at Asp445, 
GIu480 and AspSOO to abolish RNase H activity (Asp551 was left unchanged in this 
lA Pol construct), with each residue being substituted for an Ala residue, respectively 
(Davies, et al., 1991, Science 252:, 88-95; Schatz, et al., 1989, FEES Lett. 257: 311- 

25 314; Mizrahi, et al.. 1990, Nucl. Acids. Res. 18: pp. 5359-5353). HIV pol integrase 
function was abolished through three mutations at Asp626, Asp678 and Glu714. 
Again, each of these residues has been substituted with an Ala residue (Wiskerchen, 
et al., 1995. J. ViroL 69: 376-386; Leavitt. et al., 1993, J. Biol. Chem. 268: 2113- 
2119). Amino acid residue Pro3 of SEQ DD NO:4 marks the start of the RT gene. 

30 The complete amino acid sequence of L\-Pol is disclosed herein as SEQ ID N0:4 and 
Figure 17A-C, as follows: 

Met Ala Pro lie Ser Pro lie Glu Thr Val Pro Val Lys Leu Lys Pro 
Gly Met Asp Gly Pro Lys Val Lys Gin Trp Pro Leu Thr Glu Glu Lys 
lie Lys Ala Leu Val Glu lie Cys Thr Glu Met Glu Lys Glu Gly Lys 
35 lie Ser Lys lie Gly Pro Glu Asn Pro Tyr Asn Thr Pro Val Phe Ala 
lie Lys Lys Lys Asp Ser Thr Lys Trp Arg Lys Leu Val Asp Phe Arg 



GTGCCAGCTG AAGGGGGAGG 
GCTGGCCTGC ACCCACCTGG 
CTACATTGAG GCTGAGGTGA 
GAAGCTGGCT GGCAGGTGGC 
TGGGGCCACA GTGAGGGCTG 
CTACAACCCC CAGTCCCAGG 
TGGGCAGGTG AGGGACCAGG 
CCACAACTTC AAGAGGAAGG 
CATCATTGCC. ACAGACATCC 
CTTCAGGGTG TACTACAGGG 
GTGGAAGGGG GAGGGGGCTG 
GAGGAAGGCC AAGATCATCA 
CTCCAGGCAG GATGAGGACT 



CCATGCATG6 GCAGGTGGAC 
AGGGCAAGGT GATCCTGGTG 
TCCCTGCTGA GACAGGCCAG 
CTGTGAAGAC CATCCACACT 
CCTGCTGGTG GGCTGGCATC 
GGGTGGTGGC CTCCATGAAC 
CTGAGCACCT GAAGACAGCT 
GGGGCATCGG GGGCTACTCC 
AGACCAAGGA GCTCCAGAAG 
ACTCCAGGAA CCCCCTGTGG 
TGGTGATCCA GGACAACTCT 
GGGACTATGG CAAGCAGATG 
AAAGCCCGGG CAGATCT (SEQ ID 
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.Glu Leu Asn Lys Arg Thr Gin Asp 
Pro His Pro Ala Gly Leu Lys Lys 
Val Gly Asp Ala Tyr Phe Ser Val 
Tyr Thr Ala Phe Thr lie Pro Ser 
5 Arg Tyr Gin Tyr Asn Val Leu Pro 
lie Phe Gin Ser Ser Met Thr Lys 
Asn Pro Asp lie Val He Tyr Gin 
Ser Asp Leu Glu He Gly Gin His 
Gin His Leu Leu Arg Trp Gly Leu 

10 Lys Glu Pro Pro Phe Leu Trp Met 
Trp Thr Val Gin Pro He Val Leu 
Asn Asp He Gin Lys Leu Val Gly 
Tyr Pro Gly He Lys Val Arg Gin 
Lys Ala Leu Thr Glu Val He Pro 

15 Leu Ala Glu Asn Arg Glu He Leu 
Tyr Asp Pro Ser Lys Asp Leu He 
'^ly Gin Trp Thr Tyr Gin He Tyr 
Thr Gly Lys Tyr Ala Arg Met Arg 
Gin Leu Thr Glu Ala Val Gin Lys 

20 Trp Gly Lys Thr Pro Lys Phe Lys 
Glu Thr Trp Trp Thr Glu Tyr Trp 
Glu Phe Val Asn Thr Pro Pro Leu 
Lys Glu Pro He Val Gly Ala Glu 
Asn Arg Glu Thr Lys Leu Gly Lys 

25 Arg Gin Lys Val Val Thr Leu Thr 
Leu Gin Ala He Tyr Leu Ala Leu 
He Val Thr Ala Ser Gin Tyr Ala 
Asp Gin Ser Glu Ser Glu Leu Val 
Lys Lys Glu Lys Val Tyr Leu Ala 

30 Gly Gly Asn Glu Gin Val Asp Lys 
Val Leu Phe Leu Asp Gly He Asp 
Tyr His Ser Asn Trp Arg Ala Met 
Val Val Ala Lys Glu He Val Ala 
Gly Glu Ala Met His Gly Gin Val 

35 Leu Ala Cys Thr His Leu Glu Gly 
Val Ala Ser Gly Tyr He Glu Ala 
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Phe 


Trp 


Glu 


Val 


Gin 


Leu 


Gly 


He 


Lys 


Lys 


Ser 


Val 


Thr 


Val 


Leu 


Ala 


Pro 


Leu 


Asp 


Glu 


Asp 


Phe 


Arg 


Lys 


He 


Asn 


Asn 


Glu 


Thr 


Pro 


Gly 


He 


Gin 


Gly 


Trp 


Lys 


Gly 


Ser 


Pro 


Ala 


He 


Leu 


Glu 


Pro 


Phe 


Arg 


Lys 


Gin 


Tyr 


Met 


Ala 


Ala 


Leu 


Tyr 


Val 


Gly 


Arg 


Thr 


Lys 


He 


Glu 


Glu 


Leu 


Arg 


Thr 


Thr 


Pro 


Asp 


Lys 


Lys 


His 


Gin 


Gly 


Tyr 


Glu 


Leu 


His 


Pro 


Asp 


Lys 


Pro 


Glu 


Lys 


Asp 


Ser 


Trp 


Thr 


Val 


Lys 


Leu 


Asn 


Trp 


Ala 


Ser 


Gin 


He 


Leu 


Cys 


Lys 


Leu 


Leu 


Arg 


Gly 


Thr 


Leu 


Thr 


Glu 


Glu 


Ala 


Glu 


Leu 


Glu 


Lys 


Glu 


Pro 


Val 


His 


Gly 


Val 


Tyr 


Ala 


Glu 


He 


Gin 


Lys 


Gin 


Gly 


Gin 


Gin 


Glu 


Pro 


Phe 


Lys 


Asn 


Leu 


Lys 


Gly 


Ala 


His 


Thr 


Asn 


Asp 


Val 


Lys 


He 


Thr 


Thr 


Glu 


Ser 


He 


Val 


He 


Leu 


Pro 


He 


Gin 


Lys 


Glu 


Thr 


Trp 


Gin 


Ala 


Thr 


Tirp 


He 


Pro 


Glu 


Trp 


Val 


Lys 


Leu 


Trp 


Tyr 


Gin 


Leu 


Glu 


Thr 


Phe 


Tyr 


Val 


Ala 


Gly 


Ala 


Ala 


Ala 


Gly 


Tyr 


Val 


Thr 


Asn 


Arg 


Gly 


Asp 


Thr 


Thr 


Asn 


Gin 


Lys 


Thr 


Ala 


Gin 


Asp 


Ser 


Gly 


Leu 


Glu 


Val 


Asn 


Leu 


Gly 


He 


He 


Gin 


Ala 


Gin 


Pro 


Asn 


Gin 


He 


He 


Glu 


Gin 


Leu 


He 


Trp 


Val 


Pro 


Ala 


His 


Lys 


Gly 


He 


Leu 


Val 


Ser 


Ala 


Gly 


He 


Arg 


Lys 


Lys 


Ala 


Gin 


Asp 


Glu 


His 


Glu 


Lys 


Ala 


Ser 


Asp 


Phe 


Asn 


Leu 


Pro 


Pro 


Ser 


Cys 


Asp 


Lys 


Cys 


Gin 


Leu 


Lys 


Asp 


Cys 


Ser 


Pro 


Gly 


He 


Trp 


Gin 


Lys 


Val 


He 


Leu 


Val 


Ala 


Val 


His 


Glu 


Val 


He 


Pro 


/Via 


Glu 


Thr 


Gly 
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Gin Glu Thr Ala Tyr Phe Leu Leu Lys Leu Ala Gly Arg Trp Pro Val 

Lys Thr lie His Thr Ala Asn Gly Ser Asn Phe Thr Gly Ala Thr Val 

Arg Ala Ala Cys Trp Trp Ala Gly He Lys Gin Glu Phe Gly He Pro 

Tyr Asn Pro Gin Ser Gin Gly Val Val Ala Ser Met Asn Lys Glu Leu 

5 Lys Lys He He Gly Gin Val Arg Asp Gin Ala Glu His Leu Lys Thr 

Ala Val Gin Met Ala Val Phe He His Asn Phe Lys Arg Lys Gly Gly 

He Gly Gly Tyr Ser Ala Gly Glu Arg He Val Asp He He Ala Thr 

Asp He Gin Thr Lys Glu Leu Gin Lys Gin He Thr. Lys He Gin Asn 

Phe Arg Val Tyr Tyr Arg Asp Ser Arg Asn Pro Leu Trp Lys Gly Pro 

10 Ala Lys Leu Leu Trp Lys Gly Glu Gly Ala Val Val He Gin Asp Asn 

Ser T^p He Lys Val Val Pro Arg Arg Lys Ala Lys He He Arg Asp 

Tyr Gly Lys Gin Met Ala Gly Asp Asp Cys Val Ala Ser Arg Gin Asp 
Glu Asp (SEQ ID N0:4) . 

As noted above, it will be understood that any combination of the mutations 

15 disclosed above may be suitable and therefore be utilized as an lA-pol-based 

adenoviral HIV vaccine of the present invention, either when administered alone or in 
a combined modality regime and/or a prime-boost regimen. For example, it may be 
possible to mutate only 2 of the 3 residues within the respective reverse transcriptase, 
RNase-H, and integrase coding regions while still abolishing these enzymatic 

20 activities. However, the lA-pol construct described above and disclosed as SEQ ID 
N0:3, as well as the expressed protein (SEQ ID N0:4;) is preferred. It is also 
preferred that at least one mutation be present in each of the three catalytic domains. 

Another aspect of this portion of the invention are codon optimized HTV-l 
Pol-based vaccine constructions which comprise a eukaryotic trafficking signal 

25 peptide such as from tPA (tissue-type plasminogen activator) or by a leader peptide 
such as is found in highly expressed mammalian proteins such as immunoglobulin 
leader peptides. Any functional leader peptide may be tested for efficacy. However, 
a preferred embodiment of the present invention, as with HTV-l Nef constructs shown 
herein, is to provide for a HTV-l Pol mutant adenoviral vaccine construction wherein 

30 the pol coding region or a portion thereof is operatively linked to a leader peptide, 
preferably a leader peptide from human tPA. In other words, a codon optimized 
HIV-1 Pol mutant such as lA-Pol (SEQ ID N0:4) may also comprise a leader peptide 
at the amino terminal portion of the protein, which may effect cellular trafficking and 
hence, immunogenicity of the expressed protein within the host cell. As noted in 

35 Figure 16A-B, a DNA vector which may be utilized to practice the present invention 
may be modified by known recombinant DNA methodology to contain a leader signal 
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peptide of interest, such that downstream cloning of the modified HIV-1 protein of 
interest results in a nucleotide sequence which encodes a modified HfV-l tPA/Pol 
protein. In the alternative, as noted above, insertion of a nucleotide sequence which 
encodes a leader peptide may be inserted into a DNA vector housing the open reading 
5 frame for the Pol protein of interest. Regardless of the cloning strategy, the end result 
is a polynucleotide vaccine which comprises vector components for effective gene 
expression in conjunction with nucleotide sequences which encode a modified HIV-1 
Pol protein of interest, including but not limited to a HIV-1 Pol protein which 
contains a leader peptide. The amino acid sequence of the human tPA leader utilized 

10 herein is as foUows: MDAMKRGLCCVLLLCGAVFVSPSEISS (SEQ ID NO:17). 
Therefore, another aspect of the present invention is to generate HIV-1 Pol-based 
vaccine constructions which comprise a eukaryotic trafficking signal peptide such as 
from tPA. To this end, the present invention relates to a DNA molecule which 
encodes a codon optinoized wt-pol DNA construct wherein the protease (PR) activity 

15 is deleted and a human tPA leader sequence is fused to the 5' end of the coding region. 
A DNA molecule which encodes this protein is disclosed herein as SEQ ID N0:5, the 
open reading frame disclosed herein as SEQ ID N0:6. 

To this end, the present invention relates to a DNA molecule which encodes a 
codon optimized wt-pol DNA construct wherein the protease (PR) activity is deleted 

20 and a human tPA leader sequence is fused to the 5' end of the coding region ( herein, 
"tPA-wt-pol"). A DNA molecule which encodes this protein is disclosed herein as 
SEQ ID NO:5, the open reading frame being contained from an initiating Met residue 
at nucleotides 8-10 to a termination codon from nucleotides 2633-2635. SEQ ID 
N0:5 is as follows: 



GATCACCATG 


GATGCAATGA 


AGAGAGGGCT 


CTGCTGTGTG 


CTGCTGCTGT 


GTGGAGCAGT 


CTTCGTTTCG 


CCCAGCGAGA 


TCTCCGCCCC 


CATCTCCCCC 


ATTGA6ACTG 


TGCCTGTGAA 


GCTGAAGCCT 


GGCATGGATG 


GCCCCAAGGT 


GAAGCAGTGG 


CCCCTGACTG 


AGGAGAAGAT 


CAAGGCCCTG 


GTGGAAATCT 


GCACTGAGAT 


GGAGAAGGAG 


GGCAAAATCT 


CCAAGATTGG 


CCCCGAGAAC 


CCCTACAACA 


CCCCTGTGTT 


TGCCATCAAG 


AAGAAGGACT 


CCACCAAGTG 


GAGGAAGCTG 


GTGGACTTCA 


GGGAGCTGAA 


CAAGAGGACC 


CAGGACTTCT 


GGGAGGTGCA 


GCTGGGCATC 


CCCCACCCCG 


CTGGCCTGAA 


GAAGAAGAAG 


TCTGTGACTG 


TGCTGGATGT 


GGGGGATGCC 


TACTTCTCTG 


TGCCCCTGGA 


TGAGGACTTC 


AGGAAGTACA 


CTGCCTTCAC 


CATCCCCTCC 


ATCAACAATG 


AGACCCCTGG 


CATCAGGTAC 


CAGTACAATG 


TGCTGCCCCA 


GGGCTGGAAG 


GGCTCCCCTG 


CCATCTTCCA 


GTCCTCCATG 


ACCAAGATCC 


TGGAGCCCTT 


CAGGAAGCAG 


AACCCTGACA 


TTGTGATCTA 


CCAGTACATG 


GATGACCTGT 


ATGTG66CTC 


TGACCTGGAG 


ATTGGGCAGC 


ACAGGACCAA 


GATT6AGGAG 


CTGAGGCAGC 


ACCTGCTGAG 
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GTGGGGCCTG ACCACCCCTG ACAAGAAGCA CCAGAAGGAG CCCCCCTTCC TGTGGATGGG 
CTATGAGCTG CACCCCGACA AGTGGACTGT GCAGCCCATT GTGCTGCCTG AGAAGGACTC 
CTGGACTGTG AATGACATCC AGAAGCTGGT GGGCAAGCTG AACTGGGCCT CCCAAATCTA 
CCCTGGCATC AAGGTGAGGC AGCTGTGCAA GCTGCTGAGG GGCACCAAGG CCCTGACTGA 
5 GGTGATCCCC CTGACTGAGG AGGCTGAGCT GGAGCTGGCT GAGAACAGGG AGATCCTGAA 
GGAGCCTGTG CATGGGGTGT ACTATGACCC CTCCAAGGAC CTGATTGCTG AGATCCAGAA 
GCAGGGCCAG GGCCAGTGGA CCTACCAAAT CTACCAGGAG CCCTTCAAGA ACCTGAAGAC 
TGGCAAGTAT GCCAGGATGA GGGGGGCCCA CACCAATGAT GTGAAGCAGC TGACTGAGGC 
TGTGCAGAAG ATCACCACTG AGTCCATTGT GATCTGGGGC AAGACCCCCA AGTTCAAGCT 

10 GCCCATCCAG AAGGAGACCT GGGAGACCTG GTGGACTGAG TACTGGCAGG CCACCTGGAT 
CCCTGAGTGG GAGTTTGTGA ACACCCCCCC CCTGGTGAAG CTGTGGTACC AGCTGGAGAA 
GGAGCCCATT GTGGGGGCTG AGACCTTCTA TGTGGATGGG GCTGCCAACA GGGAGACCAA 
GCTGGGCAAG GCTGGCTATG T6ACCAACAG GGGCAGGCAG AAGGTGGTGA CCCTGACTGA 
CACCACCAAC CAGAAGACTG AGCTCCAGGC CATCTACCTG GCCCTCCAGG ACTCTGGCCT 

15 GGAGGTGAAC ATTGTGACTG ACTCCCAGTA TGCCCTGGGC ATCATCCAGG CCCAGCCTGA 
TCAGTCTGAG TCTGAGCTGG TGAACCAGAT CATTGAGCAG CTGATCAAGA AGGAGAAGGT 
GTACCTGGCC TGGGTGCCTG CCCACAAGGG CATTGGGGGC AATGAGCAGG TGGACAAGCT 
GGTGTCTGCT GGCATCAGGA AGGTGCTGTT CCTGGATGGC ATTGACAAGG GCCAGGATGA 
GCATGAGAAG TACCACTCCA ACTGGAGGGC TATGGCCTCT GACTTCAACC TGCCCCCTGT 

20 GGTGGCTAAG GAGATTGTGG CCTCCTGTGA CAAGTGCCAG CTGAAGGGGG AGGCCATGCA 
TGGGCA6GTG GACTGCTCCC CTGGCATCTG GCAGCTGGAC TGCACCCACC TGGAGGGCAA 
GGTGATCCTG GTGGCTGTGC ATGTGGCCTC CGGCTACATT GAGGCTGAGG TGATCCCTGC 
TGAGACAGGC CAGGAGACTG CCTACTTCCT GCTGAAGCTG GCTGGCAGGT GGCCTGTGAA 
GACCATCCAC ACTGACAATG GCTCCAACTT CACTGGGGCC ACAGTGAGGG CTGCCTGCTG 

25 GTGGGCTGGC ATCAAGCAG6 AGTTTGGCAT CCCCTACAAC CCCCAGTCCC AGGGGGTGGT 
GGAGTCCATG AACAAGGAGC TGAAGAAGAT CATTGGGCAG GTGAGGGACC AGGCTGAGCA 
CCTGAAGACA GCTGTGCAGA TGGCTGTGTT CATCCACAAC TTCAAGAGGA AGGGGGGCAT 
CGGGGGCTAC TCCGCTGGGG AGAGGATTGT GGACATCATT GCCACAGACA TCCAGACCAA 
GGAGCTCCAG AAGCAGATCA CCAAGATCCA GAACTTCAGG GTGTACTACA GGGACTCCAG 

30 GAACCCCCTG TGGAAGGGCC CTGCCAAGCT GCTGTGGAAG GGGGAGGGGG CTGTGGTGAT 
CCAGGACAAC TCTGACATCA AGGTGGTGCC CAGGAGGAAG GCCAAGATCA TCAGGGACTA 
TGGCAAGCAG ATGGCTGGGG ATGACTGTGT GGCCTCCAGG CAGGATGAGG ACTAAAGCCC 
GGGCAGATCT {SEQ ID NO: 5). 

The open reading frame of the wild type tPA-pol construct disclosed as SEQ 
35 ID NO:5 contains 875 amino acids, disclosed herein as SEQ ID NO:6, as follows: 
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
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Ala 


Val 


Phe 


Val 


Ser 


Pro 


Ser 


Glu 


He 


Ser 


Ala 


Pro 


He 


Ser 


Pro 


He 




Glu 


Thr 


Val 


Pro 


Val 


Lys 


Leu 


Lys 


Pro 


Gly 


Met 


Asp 


Gly 


Pro 


Lys 


Val 




Lys 


Gin 


Trp 


Pro 


Leu 


Thr 


Glu 


Glu 


Lys 


He 


Lys 


Ala 


Leu 


Val 


Glu 


He 




Cys 


Thr 


Glu 


Met 


Glu 


Lys 


Glu 


Gly 


Lys 


He 


Ser 


Lys 


He 


Gly 


Pro 


Glu 


5 


Asn 


Pro 


Tyr 


Asn 


Thr 


Pro 


Val 


Phe 


Ala 


He 


Lys 


Lys 


Lys 


Asp 


Ser 


Thr 




Lys 


Trp 


Arg 


Lys 


Leu 


Val 


Asp 


Phe 


Arg 


Glu 


Leu 


Asn 


Lys 


Arg 


Thr 


Gin 




Asp 


Phe 


Trp 


Glu 


Val 


Gin 


Leu 


Gly 


He 


Pro 


His 


Pro 


Ala 


Gly 


Leu 


Lys 




Lys 


Lys 


Lys 


Ser 


Val 


Thr 


Val 


Leu 


Asp 


Val 


Gly 


Asp 


Ala 


Tyr 


Phe 


Ser 




Val 


Pro 


Leu 


Asp 


Glu 


Asp 


Phe 


Arg 


Lys 


Tyr 


Thr 


Ala 


Phe 


Thr 


He 


Pro 


10 


Ser 


lie 


Asn 


Asn 


Glu 


Thr 


Pro 


Gly 


He 


Arg 


Tyr 


Gin 


Tyr 


Asn 


Val 


Leu 




Pro 


Gin 


Gly 


Trp 


Lys 


Gly 


Ser 


Pro 


Ala 


He 


Phe 


Gin 


Ser 


Ser 


Met 


Thr 




Lys 


lie 


Leu 


Glu 


Pro 


Phe 


Arg 


Lys 


Gin 


Asn 


Pro 


Asp 


He 


Val 


He 


Tyr 




Gin 


Tyr 


Met 


Asp 


Asp 


Leu 


Tyr 


Val 


Gly 


Ser 


Asp 


Leu 


Glu 


He 


Gly 


Gin 




His 


Arg 


Thr 


Lys 


He 


Glu 


Glu 


Leu 


Arg 


Gin 


His 


Leu 


Leu 


Arg 


Trp 


Gly 


15 


Leu 


Thr 


Thr 


Pro 


Asp 


Lys 


Lys 


His 


Gin 


Lys 


Glu 


Pro 


Pro 


Phe 


Leu 


Trp 




Met 


Gly 


Tyr 


Glu 


Leu 


His 


Pro 


Asp 


Lys 


Trp 


Thr 


Val 


Gin 


Pro 


He 


Val 




''»eu 


Pro 


Glu 


Lys 


Asp 


Ser 


Trp 


Thr 


Val 


Asn 


Asp 


He 


Gin 


Lys 


Leu 


Val 




Gly 


Lys 


Leu 


Asn 


Trp 


Ala 


Ser 


Gin 


He 


Tyr 


Pro 


Gly 


He 


Lys 


Val 


Arg 




Gin 


Leu 


Cys 


Lys 


Leu 


Leu 


Arg 


Gly 


Thr 


Lys 


Ala 


Leu 


Thr 


Glu 


Val 


He 


20 


Pro 


Leu 


Thr 


Glu 


Glu 


Ala 


Glu 


Leu 


Glu 


Leu 


Ala 


Glu 


Asn 


Arg 


Glu 


He 




Leu 


Lys 


Glu 


Pro 


Val 


His 


Gly 


Val 


Tyr 


Tyr 


Asp 


Pro 


Ser 


Lys 


Asp 


Leu 




lie 


Ala 


Glu 


lie 


Gin 


Lys 


Gin 


Gly 


Gin 


Gly 


Gin 


Trp 


Thr 


Tyr 


Gin 


He 




Tyr 


Gin 


Glu 


Pro 


Phe 


Lys 


Asn 


Leu 


Lys 


Thr 


Gly 


Lys 


Tyr 


Ala 


Arg 


Met 




Arg 


Gly 


Ala 


His 


Thr 


Asn 


Asp 


Val 


Lys 


Gin 


Leu 


Thr 


Glu 


Ala 


Val 


Gin 


25 


Lys 


lie 


Thr 


Thr 


Glu 


Ser 


He 


Val 


He 


Trp 


Gly 


Lys 


Thr 


Pro 


Lys 


Phe 




Lys 


Leu 


Pro 


He 


Gin 


Lys 


Glu 


Thr 


Trp 


Glu 


Thr 


Trp 


Trp 


Thr 


Glu 


Tyr 




Trp 


Gin 


Ala 


Thr 


Trp 


He 


Pro 


Glu 


Trp 


Glu 


Phe 


Val 


Asn 


Thr 


Pro 


Pro 




Leu 


Val 


Lys 


Leu 


Trp 


Tyr 


Gin 


Leu 


Glu 


Lys 


Glu 


Pro 


He 


Val 


Gly 


Ala 




Glu 


Thr 


Phe 


Tyr 


Val 


Asp 


Gly 


Ala 


Ala 


Asn 


Arg 


Glu 


Thr 


Lys 


Leu 


Gly 


30 


Lys 


Ala 


Gly 


Tyr 


Val 


Thr 


Asn 


Arg 


Gly 


Arg 


Gin 


Lys 


Val 


Val 


Thr 


Leu 




Thr 


Asp 


Thr 


Thr 


Asn 


Gin 


Lys 


Thr. 


.Glu. 


Leu 


Gin 


Ala 


He 


Tyr 


Leu 


Ala 




Leu 


Gin 


Asp 


Ser 


Gly 


Leu 


Glu 


Val 


Asn 


He 


Val 


Thr 


Asp 


Ser 


Gin 


Tyr 




Ala 


Leu 


Gly 


lie 


He 


Gin 


Ala" 


Gin 


Pro 


Asp 


Gin 


Ser 


Glu 


Ser 


Glu 


Leu 




Val 


Asn 


Gin 


He 


He 


Glu 


Gin 


Leu 


He 


Lys 


Lys 


Glu 


Lys 


Val 


Tyr 


Leu 


35 


Ala 


Trp 


Val 


Pro 


Ala 


His 


Lys 


Gly 


He 


Gly 


Gly 


Asn 


Glu 


Gin 


Val 


Asp 




Lys 


Leu 


Val 


Ser 


Ala 


Gly 


He Arg 


Lys 


Val 


Leu 


Phe 


Leu 


Asp 


Gly 


He 
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Asp Lys Ala 61n Asp GIu His Glu Lys Tyr His Ser Asn Trp Arg Ala 
Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu lie Val 
Ala Ser Cys Asp Lys Cys Gin Leu Lys Gly Glu Ala Met His Gly Gin 
Val Asp Cys Ser Pro Gly lie Trp Gin Leu Asp Cys Thr His Leu Glu 
5 Gly Lys Val lie Leu Val Ala Val His Val Ala Ser Gly Tyr lie Glu 
Ala Glu Val lie Pro Ala Glu Thr Gly Gin Glu Thr Ala Tyr Phe Leu 
Leu Lys Leu Ala Gly Arg Trp Pro Val Lys Thr lie His Thr Asp Asn 
Gly Ser Asn Phe Thr Gly Ala Thr Val Arg Ala Ala Cys Trp Trp Ala 
Gly lie Lys Gin Glu Phe Gly lie Pro Tyr Asn Pro Gin Ser Gin Gly 

10 Val Val Glu Ser Met Asn Lys Glu Leu Lys Lys lie He Gly Gin Val 
Arg Asp Gin Ala Glu His Leu Lys Thr Ala Val Gin Met Ala Val Phe 
He His Asn Phe Lys Arg Lys Gly Gly He Gly Gly Tyr Ser Ala Gly 
Glu Arg He Val Asp He He Ala Thr Asp He Gin Thr Lys Glu Leu 
Gin Lys Gin He Thr Lys He Gin Asn Phe Arg Val Tyr Tyr Arg Asp 

15 Ser Arg Asn Pro Leu Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 
Glu Gly Ala Val Val He Gin Asp Asn Ser Asp He Lys Val Val Pro 
Arg Arg Lys Ala Lys He He Arg Asp Tyr Gly Lys Gin Met Ala Gly 
Asp Asp Cys Val Ala Ser Arg Gin Asp Glu Asp (SEQ ID N0:6) . 

The present invention also relates to a codon optiniized HIV-1 Pol mutant 

20 contained within a recombinant adenoviral vector such as lA-Pol (SEQ ID N0:4) 
which comprises a leader peptide at the amino terminal portion of the protein, which 
may effect cellular trafficking and hence, immunogenicity of the expressed protein 
within the host cell. Any such adenoviral-based HTV-l DNA pol mutant disclosed in 
the above paragraphs is suitable for fusion downstream of a leader peptide, such as a 

25 leader peptide including but not limited to the human tPA leader sequence. Therefore, 
any such leader peptide-based HTV-l pol mutant construct may include but is not 
limited to a mutated DNA molecule which effectively alters the catalytic activity of 
the RT, RNase and/or IN region of the expressed protein, resulting in at least 
substantially decreased enzymatic activity one or more of the RT, RNase H and/or IN 

30 functions of HTV-l Pol. In a preferred embodiment of this portion of the invention, a 
leader peptide/HIV-1 DNA pol construct contains a mutation or mutations within the 
Pol coding region which effectively abolishes RT, RNase H and IN activity. An 
especially preferable HIV-1 DNA pol construct is a DNA molecule which contains at 
least one point mutation which alters the active site and catalytic activity within the 

35 RT, RNase H and IN domains of Pol, such that each activity is at least substantially 
abolished, and preferably totally abolished. Such a HIV-1 Pol mutant will most likely 
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comprise at least one point mutation in or around each catalytic domain responsible for 
RT, RNase H and IN activity, respectfully. An especially preferred embodiment of 
this portion of the invention relates to a human tPA leader fused to the lA-Pol protein 
comprising the nine mutations shown in Table 1 . The DNA molecule is disclosed 
herein as SEQ ID NO:7 and the expressed tPA-IA Pol protein comprises a fusion 
junction as shown in Figure 18. The complete amino acid sequence of the expressed 
protein is set forth in SEQ ID NO:8. To this end, SEQ ID N0:7 discloses the 
nucleotide sequence which codes for a human tPA leader fused to the lA Pol protein 
comprising the nine mutations shown in Table 1 (herein, "tPA-opt-IApor"). The open 
reading frame begins with the initiating Met (nucleotides 8-10) and terminates with a 
"TAA" codon at nucleotides 2633-2635. The nucleotide sequence encoding tPA- 
lAPol is also disclosed as follows: 

GATCACCATG GATGCAATGA AGAGAGG6CT CTGCTGTGTG CTGCTGCTGT GTGGAGCAGT 
CTTCGTTTCG CCCAGCGA6A TCTCCGCCCC CATCTCCCCC ATTGAGACTG TGCCTGTGAA 
GCTGAAGCCT GGCATGGATG GCCCCAAGGT GAAGCAGTGG CCCCTGACTG AGGAGAAGAT 
CAAGGCCCTG GTGGAAATCT GCACTGAGAT GGAGAAGGAG GGCAAAATCT CCAAGATTGG 
rCCCGAGAAC CCCTACAACA CCCCTGTGTT TGCCATCAAG AAGAAGGACT CCACCAAGTG 
GAGGAAGCTG GTGGACTTCA GGGAGCTGAA CAAGAGGACC CAGGACTTCT GGGAGGTGCA 
GCTGGGCATC CCCCACCCCG CTGGCCTGAA GAAGAAGAAG TCTGTGACTG TGCTGGCTGT 
GGGGGATGCC TACTTCTCTG TGCCCCTGGA TGAGGACTTC AGGAAGTACA CTGCCTTCAC 
CATCCCCTCC ATCAACAATG AGACCCCTGG CATCAGGTAC CAGTACAATG TGCTGCCCCA 
GGGCTGGAAG GGCTCCCCTG CCATCTTCCA GTCCTCCATG ACCAAGATCC TGGAGCCCTT 
CAGGAAGCAG AACCCTGACA TTGTGATCTA CCAGTACATG GCTGCCCTGT ATGTGGGCTC 
TGACCTGGAG ATTGGGCAGC ACAGGACCAA GATTGA6GAG CTGAGGCAGC ACCTGCTGAG 
GTGGGGCCTG ACCACCCCTG ACAAGAAGCA GGAGAAGGAG CCCCCCTTCC TGTGGATGG6 
CTATGAGCTG CACCCCGACA AGTGGACTGT GCAGCCCATT GTGCTGCCTG AGAAGGACTC 
CTGGACTGTG AATGACATCC AGAAGCTGGT G6GCAAGCTG AACT6GGCCT CCCAAATCTA 
CCCTGGCATC AAGGTGAGGC AGCTGTGCAA GCTGCTGAGG GGCACCAAGG CCCTGACTGA 
GGTGATCCCC CTGACTGAGG AGGCTGAGCT GGAGCTGGCT GAGAACAGGG AGATCCTGAA 
GGAGCCTGTG CATGGGGTGT ACTATGACCC CTCCAAGGAC CTGATTGCTG AGATCCAGAA 
♦GCAGGGCCAG GGCCAGTGGA CCTACCAAAT CTACCAGGAG CCCTTCAAGA ACCTGAAGAC 
TGGCAAGTAT GCCAGGATGA GGGGGGCCCA CACCAATGAT GTGAAGCAGC TGACTGAGGC 
TGTGCAGAAG ATCACCACTG AGTCCATTGT GATCTGGGGC AAGACCCCCA AGTTCAAGCT 
GCCCATCCAG AAGGAGACCT GGGAGACCTG GTGGACTGAG TACTGGCAGG CCACCTGGAT 
CCCTGAGTGG GAGTTTGTGA ACACCCCCCC CCTGGTGAAG CTGTGGTACC AGCTGGAGAA 
GGAGCCCATT GTGGGGCCTG AGACCTTCTA TGTGGCTGGG GCTGCCAACA GGGAGACCAA 
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GCTG6GCAAG 


GCTGGCTATG 


TGACCAACAG 


GGGCAGGCAG 


AAGGTGGTGA 


CCCT6ACTGA 


CACCACCAAC 


CAGAAGACTG 


CCCTCCAGGC 


CATCTACCTG 


GGCCTCCAGG 


ACTCTGGCCT 


GGAGGTGAAC 


ATTGTGACTG 


CCTCCCAGTA 


TGCCCTGGGC 


ATCATCCAGG 


CCCAGCCTGA 


TCAGTCTGAG 


TCTGAGCTGG 


TGAACCAGAT 


CATTGAGCAG 


CTGATCAAGA 


AGGAGAAGGT 


GTACCTGGCC 


TGGGTGCCTG 


CCCACAAGGG 


CATTGGGGGC 


AATGAGCAGG 


TGGACAAGCT 


GGTGTCTGCT 


GGCATCAGGA 


AGGTGCTGTT 


CCTGGATGGC 


ATTGACAAGG 


CCCAGGATGA 


GCATGAGAAG 


TACCACTCCA 


ACTGGAGGGC 


TATGGCCTCT 


GACTTCAACC 


TGCCCCCTGT 


GGTGGCTAAG 


GAGATTGTGG 


CCTCCTGTGA 


CAAGTGCCAG 


CTGAAGGGGG 


AGGCCATGCA 


TGGGCAGGTG 


GACTGCTCCC 


CTGGCATCTG 


GCAGCTGGCC 


TGCACCCACC 


TGGAGGGCAA 


GGTGATCCTG 


GTGGCTGTGC 


ATGTGGCCTC 


CGGCTACATT 


GAGGCTGAGG 


TGATCCCTGC 


TGAGACAGGC 


CAGGAGACTG 


CCTACTTCCT 


GCTGAAGCTG 


GCTGGCAGGT 


GGCCTGTGAA 


GACCATCCAC 


ACTGCCAATG 


GCTCCAACTT 


CACTGGGGCC 


ACAGTGAGG6 


CTGCCTGCTG 


GTGGGCTGGC 


ATCAAGCAGG 


AGTTTGGCAT 


CCCCTACAAC 


CCCCAGTCCC 


AGGGGGTGGT 


GGCCTCCATG 


AACAAGGAGC 


TGAAGAAGAT 


CATTGGGCAG 


GTGAGGGACC 


AGGCTGAGCA 


CCTGAAGACA 


GCTGTGCAGA 


TGGCTGTGTT 


CATCCACAAC 


TTCAAGAGGA 


AGGGGGGCAT 


CGGGGGCTAC 


TCCGCTGGGG 


AGAGGATTGT 


GGACATCATT 


GCCACAGACA 


TCCAGACCAA 


GGAGCTCCAG 


AAGCAGATCA 


CCAAGATCCA 


GAACTTCAGG 


GTGTACTACA 


GGGACTCCAG 


GAACCCCCTG 


TGGAAGGGCC 


CTGCCAAGCT 


GCTGTGGAAG 


GGGGAGGGGG 


CTGTGGTGAT 


CCAGGACAAC 


TCTGACATCA 


AGGTGGTGCC 


CAGGAGGAAG 


GCCAAGATCA 


TCAGGGACTA 


TGGCAAGCAG 


ATGGCTGGGG 


ATGACTGTGT 


GGCCTCCAGG 


CAGGATGAGG 


ACTAAAGCCC 


GGGCAGATCT 


(SEQ ID NO 


:7) . 









The open reading frame of the tPA-IA-pol construct disclosed as SEQ ID 



NO:7 contains 875 amino acids, disclosed herein as tPA-IA-Pol and SEQ ID N0:8, as 
follows: 



Met Asp Ala Met 


Lys 


Arg Gly 


Leu 


Cys 


Cys 


Val 


Leu 


Leu 


Leu 


Cys 


Gly 


Ala Val Phe Val 


Ser 


Pro 


Ser 


Glu 


lie 


Ser 


Ala 


Pro 


He 


Ser 


Pro 


He 


Glu Thr Val Pro 


Val 


Lys 


Leu 


Lys 


Pro 


Gly 


Met 


Asp 


Gly 


Pro 


Lys 


Val 


Lys Gin Trp Pro 


Leu 


Thr 


Glu 


Glu 


Lys 


He 


Lys 


Ala 


Leu 


Val 


Glu 


He 


Cys Thr Glu Met 


Glu 


Lys 


Glu 


Gly 


Lys 


He 


Ser 


Lys 


He 


Gly 


Pro 


Glu 


Asn Pro Tyr Asn 


Thr 


Pro 


Val 


Phe 


Ala 


He 


Lys 


Lys 


Lys 


Asp 


Ser 


Thr 


Lys Trp Arg Lys 


Leu 


Val 


Asp 


Phe 


Arg 


Glu 


Leu 


Asn 


Lys 


Arg 


Thr 


Gin 


Asp Phe Trp Glu 


Val 


Gin 


Leu 


Gly 


He 


Pro 


His 


Pro 


Ala 


Gly 


Leu 


Lys 


Lys Lys Lys Ser 


Val 


Thr 


Val 


Leu 


Ala 


Val 


Gly 


Asp 


Ala 


Tyr 


Phe 


Ser 


Val Pro Leu Asp 


Glu 


Asp 


Phe 


Arg 


Lys 


Tyr 


Thr 


Ala 


Phe 


Thr 


He 


Pro 


Ser lie Asn Asn 


Glu 


Thr 


Pro 


Gly 


He 


Arg 


Tyr 


Gin 


Tyr 


Asn 


Val 


Leu 


Pro Gin Gly Trp 


Lys 


Gly Ser 


Pro 


Ala 


He 


Phe 


Gin 


Ser 


Ser 


Met 


Thr 
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Lys lie Leu Glu Pro Phe Arg Lys Gin Asn Pro Asp lie Val lie Tyr 
Gin Tyr Met Ala Ala Leu Tyr Val Gly Ser Asp Leu Glu He Gly Gin 
His Arg Thr Lys He Glu Glu Leu Arg Gin His Leu Leu Arg Trp Gly 
Leu Thr Thr Pro Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu Trp 
5 Met Gly Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gin Pro He Val 
Leu Pro Glu Lys Asp Ser Trp Thr Val Asn Asp He Gin Lys Leu Val 
Gly Lys Leu Asn Trp Ala Ser Gin He Tyr Pro Gly He Lys Val Arg 
Gin Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val He 
Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu He 

10 Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu 
He Ala Glu He Gin Lys Gin Gly Gin Gly Gin Trp Thr Tyr Gin He 
Tyr Gin Glu Pro Phe Lys Asn Leu Lys Thr Gly Lys Tyr Ala Arg Met 
Arg Gly Ala His Thr Asn Asp Val Lys Gin Leu Thr Glu Ala Val Gin 
Lys He Thr Thr Glu Ser He Val He Trp Gly Lys Thr Pro Lys Phe 

15 Lys Leu Pro He Gin Lys Glu Thr Trp Glu Thr Trp Trp Thr Glu Tyr 
Trp Gin Ala Thr Trp He Pro Glu Trp Glu Phe Val Asn Thr Pro Pro 
Leu Val Lys Leu Trp Tyr Gin Leu Glu Lys Glu Pro He Val Gly Ala 
Glu Thr Phe Tyr Val Ala Gly Ala Ala Asn Arg Glu Thr Lys Leu Gly 
Lys Ala Gly Tyr Val Thr Asn Arg Gly Arg Gin Lys Val Val Thr Leu 

20 Thr Asp Thr Thr Asn Gin Lys Thr Ala Leu Gin Ala He Tyr Leu Ala 
Leu Gin Asp Ser Gly Leu Glu Val Asn He Val Thr Ala Ser Gin Tyr 
Ala Leu Gly He He Gin Ala Gin Pro Asp Gin Ser Glu Ser Glu Leu 
Val Asn Gin He He Glu Gin Leu He Lys Lys Glu Lys Val Tyr Leu 
Ala Trp Val Pro Ala His Lys Gly He Gly Gly Asn Glu Gin Val Asp 

25 Lys Leu Val Ser Ala Gly He Arg Lys Val Leu Phe Leu Asp Gly He 
Asp Lys Ala Gin Asp Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala 
Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu He Val 
Ala Ser Cys Asp Lys Cys Gin Leu Lys Gly Glu Ala Met His Gly Gin 
Val Asp Cys Ser Pro Gly He Trp Gin Leu Ala Cys Thr His Leu Glu 

30 Gly Lys Val He Leu Val Ala Val His Val Ala Ser Gly Tyr He Glu 
Ala Glu Val He Pro Ala Glu Thr Gly Gin Glu Thr Ala Tyr Phe. Leu 
Leu Lys Leu Ala Gly Arg Trp Pro Val Lys Thr He His Thr Ala Asn 
Gly Ser Asn Phe Thr Gly Ala Thr Val Arg Ala Ala Cys Trp Trp Ala 
Gly He Lys Gin Glu Phe Gly He Pro Tyr Asn Pro Gin Ser Gin Gly 

35 Val Val Ala Ser Met Asn Lys Glu Leu Lys Lys He He Gly Gin Val 
Arg Asp Gin Ala Glu His Leu Lys Thr Ala Val Gin Met Ala Val Phe 
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lie His Asn Phe Lys Arg Lys Gly Gly lie Gly Gly Tyr Ser Ala Gly 
Glu Arg He Val Asp He He Ala Thr Asp He Gin Thr Lys Glu Leu 
Gin Lys Gin He Thr Lys He Gin Asn Phe Arg Val Tyr Tyr Arg Asp 
Ser Arg Asn Pro Leu Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 
5 Glu Gly Ala Val Val He Gin Asp Asn Ser Asp He Lys Val Val Pro 
Arg Arg Lys Ala Lys. He He Arg Asp Tyr Gly Lys Gin Met Ala Gly 
Asp Asp Cys Val Ala Ser Arg Gin Asp Glu Asp {SEQ ID NO: 8) . 

EXAMPLE 18 

10 CODON OPTIMIZED HIV-1 NEF AND CODON OPTIMIZBD 

HIV-1 NEF MODIFICATIONS 
Codon optimized version of HIV-1 Nef and HIV-1 Nef modifications are 
essentially as described in U.S. Application Serial No. 09/738,782, filed 
December 15, 2000 and PCT International Application PCT/USOO/34162, also filed 

15 December 15, 2000, both documents which are hereby incorporated by reference. As 
disclosed within the above-mentioned documents, particular embodiments of codon 
optimized Nef and Nef modifications relate to a DNA molecule encoding HTV-l Nef 
trom the HTV-l jfrl isolate wherein the codons are optimized for expression in a 
mammalian system such as a human. The DNA molecule which encodes this protein 

20 is disclosed herein as SEQ ID N0:9, while the expressed open reading frame is 
disclosed herein as SEQ ID NO: 10. Another embodiment of Nef-based coding 
regions for use in the adenoviral vectors of the present invention comprise a codon 
optimized DNA molecule encoding a protein containing the human plasminogen 
activator (tpa) leader peptide fused with the NHa-terminus of the HTV-l Nef - 

25 polypeptide. The DNA molecule which encodes this protein is disclosed herein as 
SEQ ID NO: 11, while the expressed open reading frame is disclosed herein as SEQ 
ID NO: 12. Another modified Nef optimized coding region relates to a DNA molecule 
encoding optimized HIV-1 Nef wherein the open reading frame codes for 
modifications at the amino terminal myristylation site (Gly-2 to Ala-2) and 

30 substitution of the Leu-174'-Leu-175 dileucine motif to Ala-174-AIa-175, herein 

described as opt nef (G2A, LLAA). The DNA molecule which encodes this protein is 
disclosed herein as SEQ ID NO: 13, while the expressed open reading frame is 
disclosed herein as SEQ ID NO: 14. An additional embodiment relates to a DNA 
molecule encoding optimized HTV-l Nef wherein the amino terminal myristylation 

35 site and dileucine motif have been deleted, as well as comprising a tPA leader peptide. 
This DNA molecule, opt tpanef (LLAA), comprises an open reading frame which 
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encodes a Nef protein containing a tPA leader sequence fused to amino add residue 
6-216 of HIV-l Nef (jfrl), wherein Leu-174 and Leu-175 are substituted with Ala-174 
and Ala-175, herein referred to as opt tpanef (LLAA) is disclosed herein as SEQ ID 
N0:15, while the expressed open reading frame is disclosed herein as SEQ ID NO: 16. 
5 As disclosed in the above-identified documents (U.S. Application Serial No. 

09/738,782 and PCX International Application PCT/USOO/34162) and reiterated 
herein, the following nef-based nucleotide and amino acid sequences which comprise 
the respective open reading frame are as follows: 

1. The nucleotide sequence of the codon optimized version of HIV-l jrfl 
10 nef gene is disclosed herein as SEQ ID N0:9, as shown herein: 



GATCTGCCAC 


CATGGGCGGC 


AAGTGGTCCA 


AGAGGTCCGT 


GCCCGGCTGG 


TCCACCGTGA 


GG6AGAGGAT 


GAGGAGGGCC 


GAGCCCGCCG 


CC6ACAGGGT 


GAGGAGGACC 


GAGCCCGCCG 


CC6TGGGCGT 


GGGCGCCGTG 


TCCAGGGACC 


TGGAGAAGCA 


CGGCGCCATC 


ACCTCCTCCA 


ACACCGCCGC 


CACCAACGCC 


GACTGCGCCT 


GGCTG6AGGC 


CCAGGAGGAC 


GAGGAGGTGG 


GCTTCCCCGT 


GAGGCCCCAG 


GTGCCCCTGA 


GGCCCATGAC 


CTACAAGGGC 


GCCGTGGACC 


TGTCCCACTT 


CCTGAAGGAG 


AAGGGCGGCC 


TGGAGGGCCT 


6ATCCACTCC 


CAGAAGAGGC 


\GGACATCCT 


GGACCTGTGG 


GTGTACCACA 


CCCAGGGCTA 


CTTCCCCGAC 


TGGCA6AACT 


ACAGCCCCGG 


CCCCGGCATC 


AGGTTCCCCC 


TGACCTTCGG 


CTGGTGCTTC 


AAGCTGGTGC 


CCGTGGAGCC 


CGAGAAGGTG 


GAGGAGGCCA 


ACGAGGGCGA 


GAAC7UVCTGC 


CTGCTGCACC 


CCATGTCCCA 


GCACGGCATC 


GAGGACCCCG 


AGAAGGAGGT 


GCTGGAGTGG 


AGGTTCGACT 


CCAAGCTGGC 


CTTCCACCAC 


GTGGCCAGGG 


AGCTGCACCC 


CGAGTACTAC 


AAGGACTGCT 


AAAGCCCGGG 


C ( SEQ ID NO : 9 ) . 









Preferred codon usage is as follows: Met (ATG), Gly (GGC), Lys (AAG), 
Trp (TGG), Ser (TCC), Arg (AGG), Val (GTG), Pro (CCC), Thr (ACC), Glu (GAG); 

25 Leu (CTG), His (CAC). lie (ATC), Asn (AAC), Cys (TGG), Ala (GCC), Gin (GAG), 
Phe (TTC) and Tyr (TAG). For an additional discussion relating to mammalian 
(human) codon optimization, see WO 97/31 115 (PCT/US97/02294), which is hereby 
incorporated by reference. See also Figure 19 A-B for a comparion of wild type vs. 
codon optimized nucleotides comprising the open reading frame of HTV-Nef . 

30 The open reading frame for SEQ ID NO: 9 above comprises an initiating 

methionine residue at nucleotides 12-14 and a *TAA" stop codon from nucleotides 
660-662. The open reading frame of SEQ ID N0:9 provides for a 216 amino acid 
HTV-l Nef protein expressed through utilization of a codon optimized DNA vaccine 
vector. The 216 amino acid HIV-l Nef (jfrl) protein is disclosed herein as SEQ ID 

35 NO: 10, and as follows: 

Met Gly Gly Lys Trp Ser Lys Arg Ser Val Pro Gly Trp Ser Thr Val 
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Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Arg Val Arg Arg 
Thr Glu Pro Ala Ala Val Gly Val Gly Ala Val Ser Arg Asp Leu Glu 
Lys His Gly Ala lie Thr Ser Ser Asn Thr Ala Ala Thr Asn Ala Asp 
Cys Ala Trp Leu Glu Ala Gin Glu Asp Glu Glu Val Gly Phe Pro Val 
5 Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Val Asp 
Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu lie His 
Ser Gin Lys Arg Gin Asp lie Leu Asp Leu Trp Val Tyr His Thr Gin 
Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly lie Arg 
Phe Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu Pro 

10 Glu Lys Val Glu Glu Ala Asn Glu Gly Glu Asn Asn Cys Leu Leu His 
Pro Met Ser Gin His Gly lie Glu Asp Pro Glu Lys Glu Val Leu Glu 
Trp Arg Phe Asp Ser Lys Leu Ala Phe His His Val Ala Arg Glu Leu 
His Pro Glu Tyr Tyr Lys Asp Cys (SEQ ID NO:10) . 

HTV-l Nef is a 216 amino acid cytosolic protein which associates with ±e 

15 inner surface of the host cell plasma membrane through myristylation of Gly-2 

(Franchini et al., 1986, Virology 155: 593-599). While not all possible Nef functions 
'"^ve been elucidated, it has become clear that correct trafficking of Nef to the inner 
plasma membrane promotes viral replication by altering the host intracellular 
environment to facilitate the early phase of the HIV-1 Hfe cycle and by increasing the 

20 infectivity of progeny viral particles. In one aspect of the invention regarding 
codon-optimized, protein-modified polypeptides, the nef-encoding region of the 
adenovirus vector of the present invention is modified to contain a nucleotide 
sequence which encodes a heterologous leader peptide such that the anaino terminal 
region of the expressed protein will contain the leader peptide. The diversity of 

25 function that typifies eukaryotic cells depends upon the structural differentiation of 
their membrane boundaries. To generate and maintain these structures, proteins must 
be transported from their site of synthesis in the endoplasmic reticulum to 
predetermined destinations throughout the cell. This requires that the trafficking 
proteins display sorting signals that are recognized by the molecular machinery 

30 responsible for route selection located at the access points to the main trafficking 
pathways. Sorting decisions for most proteins need to be made only once as they 
traverse their biosynthetic pathways since their final destination, the cellular location 
at which they perform their function, becomes their permanent residence. 
Maintenance of intracellular integrity depends in part on the selective sorting and 

35 accurate transport of proteins to their correct destinations. Defined sequence motifs 
exist in proteins which can act as 'address labels'. A number of sorting signals have 
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been found associated with the cytoplasmic domains of membrane proteins. An 
effective induction of CTL responses often required sustained, high level endogenous 
expression of an antigen. As membrane-association via myristylation is an essential 
requirement for most of Nef's function, mutants lacking myristylation, by glycine-to- 
5 alanine change, change of the dileucine motif and/or by substitution with a tpa leader 
sequence as described herein, will be functionally defective, and therefore will have 
improved safety profile compared to wild-type Nef for use as an HIV-1 vaccine 
component. 

In another embodiment of this portion of the invention, either the DNA vector 

10 or the HTV-l nef nucleotide sequence is modified to include the human tissue-specific 
plasminogen activator (tPA) leader. As shown in Figure 16A-B, a DNA vector may 
be modified by known recombinant DNA methodology to contain a leader signal 
peptide of interest, such that downstream cloning of the modified HIV-1 protein of 
interest results in a nucleotide sequence which encodes a modified HIV-1 tPA/Nef 

15 protein. In the alternative, as noted above, insertion of a nucleotide sequence which 
encodes a leader peptide may be inserted into a DNA vector housing the open reading 
*rame for the Nef protein of interest. Regardless of the cloning strategy, the end result 
is a polynucleotide vaccine which comprises vector components for effective gene 
expression in conjunction with nucleotide sequences which encode a modified HIV-1 

20 Nef protein of interest, including but not limited to a HTV-l Nef protein which 

contains a leader peptide. The amino acid sequence of the human tPA leader utilized 
herein is as follows: l^AMKRGLCCVLLLCGAVFVSPSEISS (SEQ ID NO: 17). 

It has been shown that myristylation of Gly-2 in conjunction with a dileucine 
motif in the carboxy region of the protein is essential for Nef-induced down 

25 regulation of CD4 (Aiken et al., 1994, Cell 76: 853-864) via endocytosis. It has also 
been shown that Nef expression promotes down regulation of MHCI (Schwartz et al., 
1996, Nature Medicine 2(3): 338-342) via endocytosis. The present invention relates 
in part to DNA vaccines which encode modified Nef proteins altered in trafficking 
and/or functional properties. The modifications introduced into the adenoviral vector 

30 HIV vaccines of the present invention include but are not limited to additions, 
deletions or substitutions to the nef open reading frame which results in the 
expression of a modified Nef protein which includes an amino terminal leader 
peptide, modification or deletion of the amino terminal myristylation site, and 
modification or deletion of the dileucine motif within the Nef protein and which alter 

35 function within the infected host cell. Therefore, a central theme of the DNA 

molecules and recombinant adenoviral HTV vaccines of the present invention is (1) 
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host administration and intracellular delivery of a codon optimized nef-based 
adenoviral HIV vaccine; (2) expression of a modified Nef protein which is 
immunogenic in terms of eliciting both CTL and Th responses; and, (3) inhibiting or 
at least altering known early viral functions of Nef which have been shown to 

5 promote HTV-l replication and load within an infected host. Therefore, the nef 

coding region may be altered, resulting in a DNA vaccine which expresses a modified 
Nef protein wherein the amino terminal Gly-2 myristylation residue is either deleted 
or modified to express alternate amino acid residues. Also, the nef coding region may 
be altered so as to result in a DNA vaccine which expresses a modified Nef protein 

10 wherein the dileucine motif is either deleted or modified to express alternate amino 
acid residues. In addition, the adenoviral vector HIV vaccines of the present 
invention also relate to an isolated DNA molecule, regardless of codon usage, which 
expresses a wild type or modified Nef protein as described herein, including but not 
limited to modified Nef proteins which comprise a deletion or substitution of Gly 2, a 

15 deletion or substitution of Leu 174 and Leu 175 and/or inclusion of a leader sequence. 
Therefore, specific Nef-based constructs fiirther include the following, as 
^"'^mnlification's and not limitations. For example, the present invention relates to an 
adenoviral vector vaccine which encodes modified forms of HTV-l, an open reading 
frame which encodes a Nef protein which comprises a tPA leader sequence fused to 

20 amino acid residue 6-216 of HTV-l Nef (jfrl) is referred to herein as opt tpanef The 
nucleotide sequence comprising the open reading frame of opt tpanef is disclosed 
herein as SEQ ID NO: 1 1, as shown below: 

CATGGATGCA ATGAAGAGAG GGCTCTGCTG TGTGCTGCTG 

TTCGCCCAGC GAGATCTCCT CCAAGAGGTC CGTGCCCGGC 
25 GATGAGGAGG GCCGAGCCCG CCGCCGACAG GGTGAGGAGG 

CGTGGGCGCC GTGTCCAGGG ACCTGGAGAA GCACGGCGCC 

CGCCACCAAC GCCGACTGCG CCTGGCTGGA GGCCCAGGAG 

CGTGAGGCCC CAGGTGCCCC TGAGGCCCAT GACCTACAAG 

CTTCCTGAAG GAGAAGGGCG GCCTGGAGGG CCTGATCCAC 
30 CCTGGACCTG TGGGTGTACC ACACCCAGGG CTACTTCCCC 

CGGCCCCGGC ATCAGGTTCC CCCTGACCTT CGGCTGGTGC 

GCCCGAGAAG GTGGAGGAGG CCAACGAGGG CGAGAACAAC 

CCAGCACGGC ATCGAGGACC CCGAGAAGGA GGTGCTGGAG 

GGCCTTCCAC CACGTGGCCA GGGAGCTGCA CCCCGAGTAC 
35 (SEQ ID NO: 11) . 

The open reading frame for SEQ ID NO: 11 comprises an initiating methionine 



CTGTGTGGAG 


CAGTCTTCGT 


TGGTCCACCG 


TGAGGGAGAG 


ACCGAGCCCG 


CCGCCGTGGG 


ATCACCTCCT 


CCAACACCGC 


GACGAGGAG6 


TGGGCTTCCC 


GGCGCCGTGG 


ACCTGTCCCA 


TCCCAGAAGA 


GGCAGGACAT 


GACTGGCAGA 


ACTACACCCC 


TTCAAGCTGG- 


TGCCCGTGGA 


TGCCTGCTGC 


ACCCCATGTC 


TGGAGGTTCG 


ACTCCAAGCT 


TACAAGGACT 


GCTAAAGCC 



76 



wo 02/22080 



PCT/USOl/28861 



residue at nucleotides 2-A and a *TAA" stop codon from nucleotides 713-715. The 
open reading frame of SEQ ID N0:3 provides for a 237 amino acid HTV-l Nef 
protein which comprises a tPA leader sequence fused to amino acids 6-216 of HIV-1 
Nef, including the dileucine motif at amino acid residues 174 and 175. This 237 
5 amino acid tPA/Nef (jfrl) fusion protein is disclosed herein as SEQ ID NO: 12, and is 





shown as follows: 




















Met 


Asp Ala Met 


Lys 


Arg 


Gly 


Leu Cys Cys Val 


Leu 


Leu 


Leu 


Cys Gly 




Ala 


Val Phe Val 


Ser 


Pro 


Ser 


Glu He Ser Ser 


Lys 


Arg 


Ser 


Val Pro 




Gly 


Trp Ser Thr 


Val 


Arg 


Glu 


Arg Met Arg Arg Ala Glu 


Pro 


Ala Ala 


10 


Asp 


Arg Val Arg 


Arg 


Thr 


Glu 


Pro Ala Ala Val 


Gly Val 


Gly 


Ala Val 




Ser 


Arg Asp Leu 


Glu 


Lys 


His 


Gly Ala He Thr 


Ser 


Ser 


Asn 


Thr Ala 




Ala 


Thr Asn Ala 


Asp 


Cys 


Ala 


Trp Leu Glu Ala 


Gin 


Glu 


Asp 


Glu Glu 




Val 


Gly Phe Pro 


Val 


Arg 


Pro 


Gin Val Pro Leu Arg 


Pro 


Met 


Thr Tyr 




Lys 


Gly Ala Val 


Asp 


Leu 


Ser 


His Phe Leu Lys 


Glu 


Lys 


Gly 


Gly Leu 


15 


Glu 


Gly Leu He 


His 


Ser 


Gin 


Lys Arg Gin Asp 


He 


Leu 


Asp 


Leu Trp 




Val 


Tyr His Thr 


Gin 


Gly 


Tyr 


Phe Pro Asp Trp 


Gin 


Asn 


Tyr 


Thr Pro 




^ly 


Pro Gly He 


Arg 


Phe 


Pro 


Leu Thr Phe Gly Trp Cys 


Phe 


Lys Leu 




Val 


Pro Val Glu 


Pro 


Glu 


Lys 


Val Glu Glu Ala 


Asn 


Glu 


Gly 


Glu Asn 




Asn 


Cys Leu Leu 


His 


Pro 


Met 


Ser Gin His Gly 


He 


Glu 


Asp 


Pro Glu 


20 


Lys 


Glu Val Leu 


Glu 


Trp 


Arg 


Phe Asp Ser Lys 


Leu 


Ala 


Phe 


His His 




Val 


Ala Arg Glu 


Leu 


His 


Pro 


Glu Tyr Tyr Lys 


Asp 


Cys 


(SEQ ID NO: 12) . 



Therefore, this exemplified Nef protein, Opt tPA-Nef, contains both a tPA leader 
sequence as well as deleting the myristylation site of Gly-2A DNA molecule encoding 
HIV-1 Nef from the HTV-l jfrl isolate wherein the codons are optimized for 

25 expression in a mammalian system such as a human. 

In another specific embodiment of the present invention, a DNA molecule is 
disclosed which encodes optimized HIV-1 Nef wherein the open reading frame of a 
recombinant adenoviral HIV vaccine encodes for modifications at the amino terminal 
myristylation site (Gly-2 to Ala-2) and substitution of the Leu-174-Leu-175 dileucine 

30 motif to Ala-174-AIa-175. This open reading frame is herein described as opt nef 
(G2A,LLAA) and is disclosed as SEQ ID NO: 13, which comprises an initiating 
methionine residue at nucleotides 12-14 and a 'TAA" stop codon from nucleotides 
660-662. The nucleotide sequence of this codon optimized version of HIV-1 jrfl nef 
gene with the above mentioned modifications is disclosed herein as SEQ ID NO: 13, 

35 as follows: 



77 



wo 02/22080 



PCTAJSOl/28861 



GATCTGCCAC 


CATGGCCGGC 


AAGTGGTCCA AGAGGTCCGT 


GCCCGGCTGG 


TCCACCGTGA 


G6GAGAG6AT 


GAGGAGGGCC 


6AGCCCGCCG 


CCGACAGGGT 


GAGGAGGACC 


GAGCCCGCCG 


CCGTGGGCGT 


GGGCGCCGTG 


TCCAGGGACC 


TGGAGAAGCA 


CGGCGCCATC 


ACCTCCTCCA 


ACACCGCCGC 


CACCAACGCC 


GACTGCGCCT 


GGCTGGAGGC 


CCAGGAGGAC 


GAGGAGGTGG 


GCTTCCCCGT 


GAGGCCCCAG 


GTGCCCCTGA GGCCCATGAC 


CTACAAGGGC 


GCCGTGGACC 


TGTCCCACTT 


CCTGAAGGAG 


AAGGGCGGCC 


TGGAGGGCCT 


GATCCACTCC 


CAGAAGAGGC 


AGGACATCCT 


GGACCTGTGG 


GTGTACCACA 


CCCAGGGCTA 


CTTCCCCGAC 


TGGCAGAACT 


ACACCCCCGG 


CCCCGGCATC 


AGGTTCCCCC 


TGACCTTCGG 


CTGGTGCTTC 


AAGCTGGTGC 


CCGTGGAGCC 


CGAGAAGGTG 


GAGGAGGCCA 


ACGAGGGCGA 


GAACAACTGC 


GCCGCCCACC 


CCATGTCCCA 


GCACGGCATC 


GAGGACCCCG 


AGAAGGAGGT 


GCTGGAGTGG 


AGGTTCGACT 


CCAAGCTGGC 


CTTCCACCAC 


GTGGCCAGGG 


AGCTGCACCC 


CGAGTACTAC 


AAGGACTGCT 


AAAGCCCGGG 


C (SEQ ID NO: 13) . 









The open reading frame of SEQ ID NO: 13 encodes Nef (G2A,LLAA), 
disclosed herein as SEQ ID NO: 14, as follows: 
Met Ala Gly Lys Trp Ser Lys Arg Ser Val Pro Gly Trp 
Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Arg 
'"hr Glu Pro Ala Ala Val Gly Val Gly Ala Val Ser Arg 
Lys His Gly Ala lie Thr Ser Ser Asn Thr Ala Ala Thr 
Cys Ala Trp Leu Glu Ala Gin Glu Asp Glu Glu Val Gly 
Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly 
Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly 
Ser Gin Lys Arg Gin Asp lie Leu Asp Leu Trp Val Tyr 
Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro 
Phe Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro 
Glu Lys Val Glu Glu Ala Asn Glu Gly Glu Asn Asn Cys 
Pro Met Ser Gin His Gly lie Glu Asp Pro Glu Lys Glu 
Trp Arg Phe Asp Ser Lys Leu Ala Phe His His Val Ala 
His Pro Glu Tyr Tyr Lys Asp Cys Ser (SEQ ID NO: 14) . 

An additional embodiment of the present invention relates to another DNA 
molecule encoding optimized HIV-1 Nef wherein the amino terminal myristylation 
site and dileucine motif have been deleted, as well as comprising a tPA leader peptide. 
This DNA molecule, opt tpanef (LLAA) comprises an open reading frame which 
encodes a Nef protein containing a tPA leader sequence fused to amino acid residue 
6-216 of HIV-1 Nef (jfrl), wherein Leu-174 and Leu-ITS are substituted with Ala-174 
and Ala-175 (Ala-195 and Ala-196 in this tPA-based fusion protein). The nucleotide 



Ser Thr Val 
Val Arg Arg 
Asp Leu Glu 
Asn Ala Asp 
Phe Pro Val 
Ala Val Asp 
Leu lie His 
His Thr Gin 
Gly lie Arg 
Val Glu Pro 
Ala Ala His 
Val Leu Glu 
Arg Glu Leu 
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sequence comprising the open reading j&rame of opt tpanef (LLAA) is disclosed heiein 
as SEQ ID N0:15, as shown below: 



CATGGATGCA 


ATGAAGAGA6 


GGCTCTGCTG 


TGTGCTGCTG 


CTGTGTG6A6 


CAGTCTTC6T 


TTCGCCCAGC 


GAGATCTCCT 


CCAAGAGGTC 


CGTGCCCGGC 


T6GTCCACCG 


T6AGGGAGAG 


GATGAG6AGG 


GCC6AGCCCG 


CCGCCGACAG 


GGTGAGGAGG 


ACCGAGCCCG 


CCGCCGTGGG 


CGTGGGCGCC 


GTGTCCAGGG 


ACCTGGAGAA 


GCACGGCGCC 


ATCACCTCCT 


CCAACACCGC 


CGCCACCAAC 


GCCGACTGCG 


CCTGGCTG6A 


GGCCCAGGAG 


GACGAGGAGG 


TGGGCTTCCC 


CGTGAGGCCC 


CAGGTGCCCC 


TGAGGCCCAT 


GACCTACAAG 


GGCGCCGTGG 


ACCTGTCCCA 


CTTCCTGAAG 


GAGAAGGGCG 


GCCTGGAGGG 


CCTGATCCAC 


TCCCAGAAGA 


GGCAGGACAT 


CCTGGACCTG 


TGGGTGTACC 


ACACCCAGGG 


CTACTTCCCC 


GACTGGCAGA 


ACTACACCCC 


CGGCCCCGGC 


ATCAGGTTCC 


CCCTGACCTT 


CGGCTGGTGC 


TTCAAGCTGG 


TGCCCGTGGA 


GCCCGAGAAG 


GTGGAGGAGG 


CCAACGAGGG 


CGAGAACAAC 


TGCGCCGCCC 


ACCCCATGTC 


CCAGCACGGC 


ATCGAGGACC 


CCGAGAAGGA 


GGTGCTGGAG 


TGGAGGTTCG 


ACTCCAAGCT 


GGCCTTCCAC 


CACGTGGCCA 


GGGAGCTGCA 


CCCCGAGTAC 


TACAAGGACT 


GCTAAAGCCC 



15 (SEQ ID NO: 15) . 

The open reading frame of SEQ ID N0:7 encoding tPA-Nef (LLAA), 
lj;.iOsed herein as SEQ ID NO: 16, is as follows: 



Met 


Asp 


Ala 


Met 


Lys 


Arg 


Gly 


Leu 


Cys 


Cys 


Val 


Leu 


Leu 


Leu 


Cys 


Gly 


Ala 


val 


Phe 


Val 


Ser 


Pro 


Ser 


Glu 


He 


Ser 


Ser 


Lys 


Arg 


Ser 


Val 


Pro 


Gly 


Trp 


Ser 


Thr 


Val 


Arg 


Glu 


Arg 


Met 


Arg 


Arg 


Ala 


Glu 


Pro 


Ala 


Ala 


Asp 


Arg 


Val 


Arg 


Arg 


Thr 


Glu 


Pro 


Ala 


Ala 


Val 


Gly 


Val 


Gly 


Ala 


Val 


Ser 


Arg 


Asp 


Leu 


Glu 


Lys 


His 


Gly 


Ala 


He 


Thr 


Ser 


Ser 


Asn 


Thr 


Ala 


Ala 


Thr 


Asn 


Ala 


Asp 


Cys 


Ala 


Trp 


Leu 


Glu 


Ala 


Gin 


Glu 


Asp 


Glu 


Glu 


Val 


Gly 


Phe 


Pro 


Val 


Arg 


Pro 


Gin 


Val 


Pro 


Leu 


Arg 


Pro 


Met 


Thr 


Tyr 


Dys 


Gly 


Ala 


Val 


Asp 


Leu 


Ser 


His 


Phe 


Leu 


Lys 


Glu 


Lys 


Gly 


Gly 


Leu 


Glu 


Gly 


Leu 


He 


His 


Ser 


Gin 


Lys 


Arg 


Gin 


Asp 


He 


Leu 


Asp 


Leu 


Trp 


Val 


Tyr 


His 


Thr 


Gin Gly 


Tyr 


Phe 


Pro 


Asp 


Trp 


Gin 


Asn 


Tyr 


Thr 


Pro 


Gly 


Pro 


Gly 


He 


Arg 


Phe 


Pro 


Leu 


Thr 


Phe 


Gly 


Trp 


Cys 


Phe 


Lys 


Leu 


Val 


Pro 


Val 


Glu 


Pro 


Glu 


Lys 


Val 


Glu 


Glu 


Ala 


Asn 


Glu 


Gly 


Glu 


Asn 


Asn 


Cys 


Ala 


Ala 


His 


Pro 


Met 


Ser 


Gin 


His 


Gly 


He 


Glu 


Asp 


Pro 


Glu 


Lys 


Glu 


Val 


Leu 


Glu Trp 


Arg 


Phe 


Asp 


Ser 


Lys 


Leu 


Ala 


Phe 


His 


His 


Val 


Ala 


Arg 


Glu 


Leu 


His 


Pro 


Glu 


Tyr 


Tyr 


Lys 


Asp 


Cys 


(SEQ ID 


N0:16) . 


An adenoviral 


vector of the present invention may comprise a DNA sequence, 



regardless of codon usage, which expresses a wild type or modified Nef protein as 
35 described herein, including but not limited to modified Nef proteins which comprise a 
deletion or substitution of Gly 2, a deletion of substitution of Leu 174 and Leu 175 
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and/or inclusion of a leader sequence. Therefore, partial or fully codon optimized 
DNA vaccine expression vector constructs are preferred since such constructs should 
result in increased host expression. However, it is within the scope of the present 
invention to utilize "non-codon optimized" versions of the constructs disclosed herein, 

5 especially modified versions of HTV Nef which are shown to promote a substantial 
cellular immune response subsequent to host administration. 

Figure 20A-C show nucleotide sequences at junctions between nef coding 
sequence and plasmid backbone of nef expression vectors VlJns/nef (Figure 20A), 
VI Jns/nef(G2A,LLAA) (Figure 20B), VI Jns/tpanef (Figure 20C) and 

10 VI Jns/tpanef(LLAA) (Figure 20C, also). 5' and 3' flanking sequences of codon 

optimized nef or codon optimized nef mutant genes are indicated by bold/italic letters; 
nef and nef mutant coding sequences are indicated by plain letters. Also indicated (as 
underlined) are the restriction endonuclease sites involved in construction of 
respective nef expression vectors. VI Jns/tpanef and VI Jns/tpanef(LLAA) have 

IS identical sequences at the junctions. 

Figure 21 shows a schematic presentation of nef and nef derivatives. Amino 
^cid residues involved in Nef derivatives are presented. Glycine 2 and Leucine 174 
and 175 are the sites involved in myristylation and dileucine motif, respectively. 

20 EXAMPLE 19 

MRKAdSPol Construction and Virus Rescue 
Cojistruction of vector: shuttle plasmid and pre-adenovirus plasmid - Key 
steps performed in the construction of the vectors, including the pre-adenovirus 
plasmid denoted MRKAdSpol, is depicted in Figure 22. Briefly, the adenoviral shuttle 

25 vector for the full-length inactivated HIV-1 pol gene is as follows. The vector 

MRKpdelEl(Pac/pIX/pack450)+CMVniin+BGHpA(str.) is a derivative of the shuttle 
vector used in the construction of the MRKAd5gag adenoviral pre-plasmid. The 
vector contains an expression cassette with the hCMV promoter (no intronA) and the 
bovine growth hormone polyadenylation signal. The expression unit has been 

30 inserted into the shuttle vector such that insertion of the gene of choice at a unique 
BgPB. site will ensure the direction of transcription of the transgene will be Ad5 El 
parallel when inserted into the MRKpAd5CEl-/E3+)CIal (or MRKpAdHVE3) pre- 
plasmid. The vector, similar to the original shuttle vector contains the Pad site, 
extension to the packaging signal region, and extension to the pIX gene. The synthetic 

35 full-length codon-optimized HEV-l pol gene was isolated directly from the plasmid 
pVl Jns-HIV-pol-inact(opt). Digestion of this plasmid with Bgl JL releases the pol 
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gene intact (comprising a codon optimized lA pol sequence as disclosed in SEQ ID 
NO:3). The pol fragment was gel purified and ligated into the 
MRKpdeffil(Pac/pIX/pack450)+CMVinin+BGHpA(str.) shuttte vector at the Bgia 
site. The clones were checked for the correct orientation of the gene by using 
5 restriction enzymes DraJU/Notl, A positive clone was isolated and named 

MRKpdel+hCMVmin+EL-pol+bGHpA(s). The genetic structure of this plasmid was 
verified by PGR, restriction enzyme and DNA sequencing. The pre-adenovirus 
plasmid was constructed as follows. Shuttle plasmid MRKpdel+hCMVniin+EL- 
pol+bGHpA(S) was digested with restriction enzymes Pad and BstllOl I (or its 

10 isoschizomer, BsfZAOl I) and then co-transformed into E, coli strain BJ5183 with 
linearized {Clal digested) adenoviral backbone plasmid, MRKpAd(El-/E3+)ClaL 
The resulting pre-plasmid originally named MRKpAd+hCMVmin+FL- 
pol+bGHpA(S)E3+ is now referred to as "pMRKAdSpol". The genetic stmcture of 
the resulting pMRKAdSpol was verified by PGR, restriction enzyme and DNA 

15 sequence analysis. The vectors were transformed into competent coli XL-1 Blue 
for preparative production. The recovered plasmid was verified by restriction enzyme 
digestion and DNA sequence analysis, and by expression of the pol transgene in 
transient transfection cell culture. The complete nucleotide sequence of this 
pMRKAdSHTV-lpoI adenoviral vector is shown in Figure 26 A- AO. 

20 Generation of research- grade recombinant adenovirus - The pre-adenovirus 

plasmid, pMRKAdSpol, was rescued as infectious virions in PER.G6® adherent 
monolayer cell culture. To rescue infectious virus, 12 fig of pMRKAdSpol was 
digested with restriction enzyme Pad (New England Biolabs) and 3.3 |xg was 
transfected per 6 cm dish of PER.G6® cells using the calcium phosphate co- 

25 precipitation technique (Gell Phect Transfection Kit, Amersham Pharmacia Biotech 
Inc.). Pad digestion releases the viral genome from plasmid sequences allowing viral 
replication to occur after entry into PER.G6®cells. Infected cells and media were 
harvested 6 -10 days post-transfection, after complete viral cytopathic effect (GPE) 
was observed. Infected cells and media were stored at < -60*^G. This pol containing 

30 recombinant adenovirus is referred to herein as "MRKAdSpol". This recombinant 
adenovirus expresses an inactivated HIV-1 Pol protein as shown in SEQ ID NO:6. 



EXAMPLE 20 
MRKAdSNef Gonstruction and Virus Rescue 
35 Construction of vector: shuttle placid and pre-adenovirus plasmid - Key 

steps performed in the construction of the vectors, including the pre-adenovirus 
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plasmid denoted MRKAdSnef, is depicted in Figure 23. Briefly, as shown in 
Example 19 above, the vector 

MElKpdelEl(Pac/pIX/pack450)4<:MVnun+BGHpA(str.) is the shuttle vector used in 
the construction of the MRKAdSgag adenoviral pre-plasmid. It has been modified to 

5 contain the Pad site, extension to the packaging signal region, and extension to the 
pIX gene. It contains an expression cassette with the hCMV promoter (no intronA) 
and the bovine growth hormone polyadenylation signal. The expression unit has been 
inserted into the shuttle vector such that insertion of the gene of choice at a unique 
Bgll 1 site will ensure the direction of transcription of the transgene will be Ad5 El 

10 parallel when inserted into the MRKpAd5(El-/E3+)Clal pre-plasmid. The synthetic 
full-length codon-optimized HIV-1 nef gene was isolated directly from the plasmid 
pVlJns/nef (G2A,LLAA). Digestion of this plasmid with Bglll releases the pol gene 
intact, which comprises the nucleotide sequence as disclosed in SEQ ID NO: 13. The 
nef fragment was gel purified and ligated into the 

15 MRKpdelEl+CMVmin+BGHpA(str.) shuttle vector at the Bgll 1 site. The clones 

were checked for correction orientation of the gene by using restriction enzyme Seal. 
\ positive clone was isolated and named MRKpdelElhCMVminFL-nefBGHpA(s). 
The genetic structure of this plasmid was verified by PGR, restriction enzyme and 
DNA sequencing. The pre-adenovirus plasmid was constructed as follows. Shuttle 

20 plasmid MRKpdelElhCMVminFI^nefBGHpA(s) was digested with restriction 

enzymes Pad and 5^^1107 1 (or its isoschizomer, 5^^2107 1) and then co-transformed 
into E. coli strain BJ5183 with linearized {Clal digested) adenoviral backbone 
plasmid, MRKpAd(El/E3+)Clal. The resulting pre-plasmid originally named 
MElKpdelElhCMVnainFL-nefBGHpA(s) is now referred to as "pMRKAdSnef The 

25 genetic structure of the resulting pMRKAdSnef was verified by PGR, restriction 

enzyme and DNA sequence analysis. The vectors were transformed into competent E. 
coli XL-1 Blue for preparative production* The recovered plasmid was verified by 
restriction enzyme digestion and DNA sequence analysis, and by expression of the 
nef transgene in transient transfection cell culture. The complete nucleotide sequence 

30 of this pMRKAd5HIV-lnef adenoviral vector is shown in Figure 27 A- AM. 

Generation of research- grade recombinant adenovirus - The pre-adenovirus 
plasmid, pMRKAdSnef, was rescued as infectious virions in PER.C6® adherent 
monolayer cell culture. To rescue infectious virus, 12 ^ig of pMRKAdnef was 
digested with restriction enzyme Pad (New England Biolabs) and 3.3 \x% was 

35 transfected per 6 cm dish of PER.C6® cells using the calcium phosphate co- 
precipitation technique (Cell Phect Transfection Kit, Amersham Pharmacia Biotech 
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Inc.). Pad digestion releases the viral genome from plasmid sequences allowing 
viral replication to occur after entry into PER.C6®cells. Infected cells and media were 
harvested 6 -10 days post-transfection, after complete viral cytopathic effect (CPE) 
was observed. Infected cells and media were stored at < -60^C. This nef containing 
5 recombinant adenovirus is now referred to as "MRKAdSnef *. 

EXAMPLE 21 

Construction of Murine CMV Promoter Containing Shuttle Vectors 
for Inactivated Pol and Nef/G2A,LLAA 

10 . The murine CMV (mCMV) was amplified from the plasmid pMH4 (supplied 

by Frank Graham, McMaster University) using the primer set: mCMV {Not I) 
Forward: S'-ATA AGA A TG CGG CCGC CA TAT ACT GAG TCA TTA GG-3' 
(SEQ ID NO: 20); mCMV (Bgl II)Reverse: 5'-AAG G AAGATCT A CCG ACG 
CTG GTC GCG CCT C-3' (SEQ ID N0:21). The underlined nucleotides represent 

15 the Not I and the Bgl H sites respectively for each primer. This PGR amplicon was 
used for the construction of the mCMV shuttle vector containing the transgene in the 
parallel orientation. The hCMV promoter was removed from the original shuttle 
vector (containing the hCMV-gag-bGHpA transgene in the El parallel orientation) by 
digestion with Not I and Bgl U. The mCMV promoter (Not UBgl U digested PGR 

20 product) was inserted into the shuttle vector in a directional manner. The shuttle 
vector was then digested with Bgl II and the gag reporter gene {Bgl II fragment) was 
re-inserted back into the shuttle vector. Several clones were screened for correct 
orientation of the reporter gene. For the construction of the mCMV-gag in the El 
antiparallel orientation, the mCMV promoter was amplified from the plasmid pMH4 

25 using the following primer set: mCMV {Asc I) Forward: 5 - ATA AGA A TG GCG 
CGCCAT ATA CTG AGT CAT TAG G (SEQ ID NO:22); mCMV {Bgl II) Reverse: 
5* AAG GAAGATCTA CCG ACG CTG GTC GCG CCT C (SEQ ID NO:23). The 
underlined nucleotides represent the Asc I and Bgl TL sites, respectively for each 
primer. The shuttle vector containing the hCMV-gag transgene in the El antiparallel 

30 orientation was digested with A^cl and Bgll 1 to remove the hCMV-gag portion of the 
transgene. The mCMV promoter (Aycl/5g/ll digested PCR product) was inserted 
into the shuttle vector in a directional manner. The vector wate then digested with 
BglW and the gag reporter gene {BglW fragment) was re-inserted. Several clones 
were screened for correct orientation of the reporter gene. For each of the full length 

35 lA pol and full length nef^G2A,LLAA genes, cloning was performed using the unique 
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Bgl n site within the mCMV-bGHpA shuttle vector. The pol and nef genes were 
excised firom their respective pVl Jns plasmids by Bgl U digestion. 

EXAMPLE 22 

5 Construction of mCMV Full Length Inactivated Pol and 

Full Length nef/G2A.LLAA Adenovectors 
Each of these transgenes of Example 21 were inserted into the modified 
shuttle vector in both the El parallel and El anti-parallel orientations. Pacl and 
BstZl 101 digestion of each shuttle vector was performed and each specific transgene 

10 fragment containing the flanking Ad5 sequences was isolated and co-transformed 
with Cla I digested MRKpAd5(E3+) or MRKpAd5(E3-) aderiovector plasmids via 
bacterial homologous recombination in BJ5183 E, coli cells. Recombinant pie- 
plasmid adenovectors containing the various transgenes in both the E3- and E3+ 
versions (and in the El parallel and El antiparallel orientations) were subsequently 

15 prepared in large scale following transformation into XH Blue E, coli cells and 
analyzed by restriction analysis and sequencing. 

EXAMPLE 23 
Construction of hCMV-tpa-nef (LLAA) Adenovector 

20 The tpa-nef gene was amplified out from GMP grade pVlJns-tpanef (LLAA) vector 
using the primer sets: Tpanef (BamHI) F 5 -ATT GGA TCC ATG GAT GCA ATG 
AAG AGA GGG (SEQ ID 24); Tpanef (BamHI) R 5'-ATA GGA TCC TTA GCA 
GTC CTT GTA GTA CTC G (SEQ ID NO:25). The resulting PGR product was 
digested with So^nHI, gel purified and cloned into the Bgl n site of MRKAdSCMV- 

25 bGHpA shuttle vector {Bgl n digested and calf intestinal phosphatase treated). 

Clones containing the tpanef (LLAA) gene (see SEQ ID NO: 15 for complet coding 
region) in the correct orientation with respect to the hCMV promoter were selected 
following Sea I digestion. The resulting MRKAdStpanef shuttle vector was digested 
with Pac I and Bst Zl 101 and cloned into the E3+ MRKAdS adenovector via bacterial 

30 homologous recombination techniques. 

EXAMPLE 24 
Immunogenicity of MRKAdSpol and MRKAdSnef Vaccine 
Materials and Methods - Rodent Immunization - Groups of N=10 B ALB/c 
35 mice were immunized i.m. with the following vectors: (1) MRKAd5hCMV-IApol 
(E3+) at either 10^7 vp and 10^9 vp; and (2) MRKAdShCMV-IApol 0E3-) at either 
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10^7 vp and 10^9 vp. At 7 weeks post dose, 5 of the 10 mice per cohort were boosted 
with the same vector and dose they initially received. At 3 weeks post the second 
does, sera and spleens were collected jBrom all the animals for RT EUS A and IFNg 
Elispot analyses, respectively. For all rodent immunizations, the Ad5 vectors were 
5 diluted in 5 mM Tris, 5% sucrose, 75 mM NaCl, 1 mM MgC12, 0.005% polysorbate 
80, pH 8.0. The total dose was injected to both quadricep muscles in 50 |liL aliquots 
using a 0.3-mL insulin syringe with 28-1/2G needles (Becton-Dickinson, Franklin 
Lakes, NJ). 

Groups of N=I0 C57/BL6 mice were immunized i.m. with the following 
10 vectors: (1) MRKAd5hCMV-nef(G2A,LLAA) (E3+) at either 10^7 vp and 10'^9 vp; 
(2) MRKAd5mCMV-nef(G2A,LLAA) (E3+) at either 10'^7 vp and 10^9 vp; and (3) 
MRKAd5mCMV-tpanef(LLAA) (E3+) at either 10'^7 vp and 10^9 vp. At 7 weeks 
post dose, 5 of the 10 mice per cohort were boosted with the same vector and dose 
they initially received. At 3 weeks post the second does, sera and spleens were 
15 collected from aU the animals for RT ELISA and IFNg ELIspot analyses, 
respectively. 

Non-human Primate immunization - Cohorts of 3 rhesus macaques (2-3 kg) 
were vaccinated with the following Ad vectors: (1) MRKAd5hCMV-IApol (E3+) at 
either 10'^9 vp and lO'^ll vp dose; and (2) MRKAdShCMV-IApol (E3<) at either 

20 10^9 vp and 10^1 1 vp; (3) MRKAd5hCMV-nef(G2A,LLAA) (E3+) at either 10^9 vp 
and 10^11 vp; and (4) MRKAd5mCMV-nef(G2A,LLAA) (E3+) at either 10^9 vp and * 
10^11 vp. The vaccine was administered to chemically restrained monkeys (10 
mg/kg ketamine) by needle injection of two 0.5 mL aliquots of the Ad vectors (in 5 
mM Tris, 5% sucrose, 75 mM NaCl, 1 mM MgCla, 0.005% polysorbate 80, pH 8.0) 

25 into both deltoid muscles. TTie animals were immunized twice at a 4 week interval 
(T=0, 4 weeks). 

Murine anti-RTandanti-nefEUSA - Anti-RT titers were obtained following 
standard secondary antibody-based ELISA. Maxisorp plates (NUNC, Rochester; NY) 
were coated by overnight incubation with 100 \xLofl\ig /mL HTV-l RT protein 

30 (Advanced Biotechnologies, Columbia, MD) in PBS. For anti-nef ELISA, 100 uL of 1 
ug/mL HTV-l nef (Advanced Biotechnologies, Columbia, MD) was used to coat the 
plates. The plates were washed with PBS/0.05% Tween 20 using Titertek MAP 
instrument (HunstviUe, AL) and incubated for 2 h with 200 jiL/well of blocking 
solution (PBS/0.05% tween/1% BSA). An initial serum dilution of 100-fold was 

35 performed followed by 4-fold serial dilution. 100-|jL aliquots of serially diluted 
samples were added per well and incubated for 2 h at room tempeiature. The plates 
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were washed and 100 pL of 1/1000-diluted HRP-rabbit anti-mouse IgG (ZYMED, 
San Francisco, CA) were added with 1 h incubation. The plates were washed 
thoroughly and soaked with 100 1,2-phenylenedianiine dihydrochloride/hydrogen 
peroxide (DAKO, Norway) solution for 15 min. The reaction was quenched by 

5 adding 100 of 0.5M H2SO4 per well. OD492 readings were recorded using Titertek 
Multiskan MCC/340 with S20 stacker. Endpoint titers were defined as the highest 
serum dilution that resulted in an absorbance value of greater than or equal to 0.1 
OD492 (2.5 times the background value). 

Non-human primate and murine ELIspot assays - The enzyme-linked 

10 immuno-spot (ELJSpot) assay was utilized to enumerate antigen-specific INPy- 
secreting cells from mouse spleens (Miyahira, et al.l995, 7. Immunol Methods 
181:45-54) or macaque PBMCs. Mouse spleens were pooled from 5 mice/cohort and 
single cell suspensions were prepared at 5xlO^/mL in complete RPMI media 
(RPNai640, 10% FBS, 2mM L-glutamine. lOOU/mL Penicillin, 100 u/mL 

15 streptomycin, 10 mM Hepes, 50 uM P-ME). Rhesus PBMCs were prepared from 8- 
15 mL of heparinized blood following standard FicoU gradient separation (Coligan, et 
"1, 1998, Current Protocols in Immunology, John Wiley & Sons, Inc.). Multiscreen 
opaque plates (Millipore, France) were coated with 100 jiUwell of either 5 pig/mL 
purified rat anti-mouse IFN-y IgGl, clone R4-6A2 (Pharmingen, San Diego, CA), or 

20 15 ug/mL mouse anti-human IFN-y IgGza (Cat. No. 1598-00, R&D Systems, 
Minneapohs, MN) in PBS at 4°C overnight for murine or monkey assays, 
respectively. The plates were washed with PBS/penicillin/streptomycin and blocked 
with 200 jAiy well of complete RPMI media for 37 °C for at least 2 h. 

To each well, 50 of cell samples (4-5x10^ cells per well) and 50 pL of the 

25 antigen solution were added. To the control well, 50 fiL of the media containing 

DMSO were added; for specific responses, either selected peptides or peptide pools (4 
ug/mL per peptide final concentration) were added. For B ALB/c mice immunized 
with the pol constructs, stimulation was conducted using a pool of CD4^-epitope 
containing 20-mer peptides (aa21-40, aa411-430, aa641-.660, aa731-750, aa771-790) 

30 or a pool of CD8'^-epitope containing peptides (aa201-220, aa3 11-330, aa78 1-800). 
For C57/BL6 mice immunized with the nef construct, either aa51-70 (CDS'** T cell 
epitope) or aa8 1-100 (CD4'*') peptide derived from the nef sequence was added for 
specific stimulation. In monkeys, the responses against pol were evaluated using two 
pools (L and R) of 20-aa peptides that encompass the entire pol sequence and overlap 

35 by 10 amino acids. In monkeys vaccinated with the nef constructs, a single pool 

containing 20-mer peptides covering the entire HIV-l nef sequence and overlapping 
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by 10 aa was used. Each sample/antigen mixture was perfonned in triplicate wells for 
murine samples or in duplicate wells for rhesus PBMCs. Plates were incubated at 
37°C, 5% CO2, 90% humidity for 20-24 h. The plates were washed with PBS/0.05% 
Tween 20 and incubated with 100 jiL/well of either 1.25 jxg/mL biotin-conjugated rat 
5 anti-mouse IFN-y mAb, clone XMG1.2 (Pharmingen) or of 0.1 ug/mL biotinylated 
anti-human IFN-gamma goat polyclonal antibody (R&D Systems) at 4°C overnight. 
The plates were washed and incubated with 100 pL/well 1/2500 dilution of 
strepavidin-alkaHne phosphatase conjugate (Pharmingen) in PBS/0.005% Tween/5% 
FBS for 30 min at 37 °C. Spots were developed by incubating with 100 \xUv/eU 1- 

10 step NBT/BCIP (Pierce Chemicals) for 6-10 min. The plates were washed with water 
and allowed to air dry. The number of spots in each well was determined using a 
dissecting microscope and the data normalized to 10^ cell input. 

Non-human Primate anti-RTEUSA - The pol-specific antibodies in the 
monkeys were measured in a competitive RT EIA assay, wherein sample activity is 

15 determined by the ability to block RT antigen from binding to coating antibody on the 
plate well. Briefly, Maxisorp plates were coated with saturating amounts of pol 
^sitive human serum (#9711 1234). 250 uL of each sample is incubated with 15 uL 
of 266 ng/mL RT recombinant protein (in RCM 563, 1% BSA, 0.1% tween, 0.1% 
NaNs) and 20 uL of lysis buffer (Coulter p24 antigen assay kit) for 15 min at room 

20 temperature. Similar mixtures are prepared using serially diluted samples of a 

standard and a negative control which defines maximum RT binding. 200 uUwell of 
each sample and standard were added to the washed plate and the plate incubated 16- 
24 h at room temperature. Bound RT is quantified following the procedures described 
in Coulter p24 assay kit and reported in milUMerck units per mL arbitrarily defined 

25 by the chosen standard. 

Results - Rodetvt Studies - B ALB/c mice (n=5 mice/cohort) were immunized 
once or twice with varying doses of MRKAd5hCMV-IApol(E3+) and 
MRKAd5hCMV-IApol(E3-). At 3 weeks after the second dose, Anti-pol IgG levels 
were determined by an ELJS A assay using RT as a surrogate antigen. Cellular 

30 response were quantified via IFNy ELISpot assay against pools of pol-epitope 

containing peptides. The results of these assays are summarizedin Table 10. The . . 
results indicate that the mouse vaccinees exhibited detectable anti-RT IgGs with an 
adenovector dose as low as 10^7 vp. The humoral responses are highly dose- 
dependent and are boostable with a second immunization. One or two doses of either 

35 pol vectors elicit high frequencies of antigen-specific CD4* and CDS* T cells; the 
responses are weakly dose-dependent but are boostable with a second immunization. 
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Table 10. Immunofienicity of MF 


lKAd5| 


dol Vectors in B ALB/c mice. 


Group 


Vaodne 


Dose 


No. of 
Doses 


Antl-RT laG Titers" 


SFCrt 0*6 cells' 


GMT 




-SE 


Medium 


CD4<i^ 
peptide 

DOOl 


CO&i- 
peptlde 
pool 


1 


MRKAdShCMVFLpol (E34^) 


10^7 VP 


2 
1 


310419 
919 


301785 
372 


163020 
266 


1(1) 
1(1) 


75(4) 
72(9) 


2313(67) 
533(41} 


2 


MRKAdShCMVFLpoJ {E3f) 


icy^vp 


2 
1 


1638400'' 
713155 


0 

628520 


0 

303555 


2(2) 
1(1) 


114(9) 
48(7) 


2063(182) 
733(89) 


3 


MRKA(15hCMVapol(E3-) 


10^7vp 


2 
1 


. 310419 

6400 


38621 B 
14013 


172097 
4393 


0(0) 
10(8) 


223(7) 
141(21) 


2607(27) 
409(26) 


4 


MRKAd5hCMVB.pol (E3-) 


i(y«vp 


2 
1 


1638400" 
1241675* 


0 

396725 


0 

300661 


1(1) 

m 


^60(13) 
38(13) 


2385(11) 
833(83) 


5 


Naive 


none 


none 


57 


9 


7 


9(2) 


11(4) 


10(1) 



*GMT, geometric mean titer of the cohort of 5 mice; SE, standard error of the gemetric mean 
''Near or at the upper Dmit of the serial dilution; henoe, could be greater than this value 

"No. of Spot-forming CeDs per mlllton splecnoytes; mean values of tripllcales are reported along with standard errors In parenthesis. 



C57/BL6 mice were iimnunized once or twice with varying doses of 
MRKAd5hCMV-nef(G2A,LLAA) (E3+), MRKAd5mCMV-nef(G2A,LLAA) (E3+) 
at either lO'^? vp and(3) MRKAd5mCMV-tpanef(LLAA) (E3+) at either 10^7 vp and 
10^9 vp. The inmiune response were analyzed using similar protocols and the results 

z listed in Table 11. While anti-nef IgG responses could not be detected in this 
model system with any of the constructs, there are strong indications of a cellular 
immunity generated against nef using the ELIspot assay. 



Table 11. Immunogenicity of M] 


RKAdSnef Vectors in C57/BL6 r 


nice. 


Group 


Vaccine 


Dose 


No. of 
Doses 


Antl-nef IgG Titers" 


S 


FC/1 0*6 cell 




GMT 


+SE 


-SE 


Medium 


aa51-70 
CD8> 


aa81-10a 
CD4^ 


1 


MRKAdShCMVFLnef (E3+) 


10^7vp 


2 
1 


174 
132 


70 
42 


60 
32 


1(1) 
0(0) 


23(1) 
0(0) 


1(1) 
0(0) 


2 . 


MHKAd6hCMVFLnef (E3+) 


10^ vp 


2 
1 


174 
132 


70 
42 


50 
32 


0(0) 

1(1) 


61(7) 
62(7) 


4(2) 
3(1) 


3 


MRKAdStnCMVanef (E^) 


10^7vp 


2 
1 


132 
115 


42 
46 


32 
33 


3(1) 
3(2) 


15(5) 
3(2) 


5(2) 
4(2) 


4 


MRKAdSmCMVFLnef (E3f) 


10A9vp 


2 
1 


132 
132 


42 
42 


32 
32 


4(2) 
2(1) 


83(13) 
29(2) 


5(1) 
4(0) 


5 


MRKAd5mCMVtpanef(E3+) 


10^7 vp 


2 
1 


132 
100 


42 
0 


32 
0 


3(2) 
3(1) 


14(2) 
13(4) 


5(1) 
10(3) 


6 


MRKAd6mCMVtpanef(E34^) 


10^ vp 


2 
1 


230 
115 


170 
46 


98 
33 


3(2) 
7(1) 


145(29) 
151(14) 


4(0) 
10(0) 


? 


Nai\/e 


none 


none 


152 


78 


62 • 


21(2) 


18(6) 


26(3) 



•GMT, geometric mean liter of the cohort of 5 mice; SE, standard error of the gemetric mean 
''No, of spot-forming oeBs per million splecnoytes; mean values of triplicates are reported along wllh standard errors h parenthesis. 



Monkey Studies - Cohorts of 3 rhesus macaques were inununized with 2 doses 
of MRKAd5hCMV-IApolCE3+) and MRKAd5hCMV-IApolCE3-). The number of 
antigen-specific T cells (per million PBMCs) were enumerated using one of two 
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peptide pools (L and R) that cover the entire pol sequence; the results are listed in 
Table 12. Moderate-to-strong T cell responses were detected in the vaccinees using 
either constructs even at a low dose of lO'^P vp. Longitudinal analyses of the anti-RT 
antibody titers in the animals suggest that the pol transgene product is expressed 
efficiently to elicit a humoral response (Table 13). It would appear that generally 
higher immune responses were observed in animals that received the E3- construct 
compared to the E3+ virus. 

Table 12. Pol-specific T Cell Responses in MRKAdSpol Immunized Rhesus 



Macaques. 



Vaccine (TbO,4 wks) 


Monk# 


Prebleed 


T=4 


T=7 


TalB 






Mock 


Pol L 


PoIR 


Mock 


PolL 


PoIR 


Mock 


Pol L 


PoIR 


Mock 


PolL 


PoIR 




99CT00 


1 


0 


0 


1 


3B 


31 


0 


52 


146 


0 


49 


715 


10*11 vp 


99C215 


1 


2 


2 


10 


96 


249 


1 


109 


305 


22 


88 


250 


99D201 


5 


5 


4 


6 


149 


95 


0 


40 


35 


0 


35 


18 


M7KAd5hCMy/-IApd(E3+) 


990212 


0 


2 


0 


4 


331 


114 


0 


58 


14 


0 


6 


6 


1(^9 VP 


99D180 


0 


4 


2 


0 


19 


192 


4 


36 


156 


5 


38 


106 




99C201 


8 


5 


21 


6 


62 


62 


0 


18 


32 


1 


14 


65 


M^KAdShCMV-IApdC^d-} 


99D239 


5 


2 


2 


20 


82 


172 


1 


66 


114 


9 


21 


40 


io*n vp 


99C186 


4 


12 


6 


5 


120 


421 


2 


271 


489 


16 


875 


530 




99C084 


1 


8 


9 


8 


84 


464 


0 


14 


236 


1 


24 


264 


MRKAd5hCMV^ApdCE3-) 


OC7C 


10 


10 


8 


12 


724 


745 


4 


322 


378 


4 


IBS 


176 


l(K9vp 


CD1G 


2 


0 


1 


5 


474 


468 


0 


232 


212 


0 


101 


121 




CDIl 


6 


6 


12 


10 


98 


110 


5 


60 


80 


8 


25 


34 


Ito-VB 


OBSQ 


nd 


nd 


nd 


nd 


nd 


nd 


4 


2 


2 




1 


2 



nd, not datoffnlnfld 
RaportBd are SFC per mOllon PGMCs; mean ol dttpDcato wens. 



Table 13. Anti-RT Ig Levels in MRKAdSpol Immunized macaques. 



RT ANTIBODY ASSAY TITERS INmMUyhlL 








Vccdn^onkey T ag 


T=4 


T=7 


T=12 


1=16 


MRKAd5hCMV-IAc>ol(E3-f), vo 










99C100 


61 


1999 


6928 


4768 


99C215 


81 


1641 


2356 


2767 


99D201 


53 


336 


639 


387 












MRKAd5hCMV-IADOl(E3+). 10''9vD 










99D212 


10 


40 


49 


68 


99D180 


<10 


36 


79 


93 


99C201 


<10 


37 


71 


76 












MRKAdShCMV-IADoKES-). lO^'ll VD 










99D239 


44 


460 


1234 


1015 


99C186 


21 


233 


480 


345 


99C084 


235 


2637 


2858 


1626 












MRKAd5hCMV-IADOl(E3-). }0'^vo 










CC7C 


32 


175 


306 


235 


CDIG 


20 


140 


273 


419 


CDll 


16 


112 


149 


237 
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When rhesus macaques were immumzed i,m. with two doses of MRKAdSnef 
5 constructs, vigorous T cell responses ranging from 100 to as high as 1100 per million 
were observed in 8 of 12 vaccinees (Table 14). The efficacies of the mCMV- and 
hCMV- driven nef constructs are comparable on the basis of the data generated thus 
far. 

10 Table 14. Nef-specific T cell Responses in MRKAdSnef hnmunized Rhesus 



Macaques, 



Vaccine (T=0,4 wks) 


Monk# 


Pre 


T=4 


T=7 


T= 


16 






Mock 


Nef 


Mock 


Nef 


Mock 


Nef 


Mock 


Nef 


MBKAd6hCMV-nef(Q2A,LLAA) (E3+) 


CD2D 


0 


4 


31 


440 


4 


368 


1 


251 


10^11 vp 


CC7B 


0 


0 


2 


521 


D 


178 


1 


1522 


CC61 


2 


9 


31 


112 


0 


108 


11 


100 


MRKAdShCMV-nefCGaMLAA) (E3+) 


CC2K 


9 


9 


6 


52 


0 


35 


0 


15 


Ityvg vp 


CD15 


5 


4 


30 


998 


• 2 


586 


0 


434 


CD16 


6 


1 


6 


1146 


0 


369 


1 


212 


MRKAd6mCMV-nef(G2A,LLAA) (E3f) . 


99D191 


1 


5 


4 


614 


0 


298 


2 


419 


10^11 vp 


99D144 


4 


6 


5 


434 


0 


1100 


2 


932 


99C193 


1 


2 


1 


58 


1 


22 


0 


64 


MRKAd5mCMV-nef(Q2A.LJLAA) {E3-I-) 


99D224 


1 


11 


14 


231 


1 


125 


0 


70 


1(y*9vp 


990250 


8 


9 


4 


108 


0 


54 


0 


6 


9901 20 


1 


6 


20 


299 


0 


92 


0 


79 


NaTve 


083Q 


nd 


nd 


18 


22 


4 


5 


2 


1 



EXAMPLE 25 

15 Comparison of Clade B vs. Clade C T Cell Responses in HIV-infected Subjects 
PBMC samples collected from two dozens of patients infected with HIV-1 in 
US were tested in EUSPOT assays with peptide pools of 20-mer peptides overlapping 
by 10 amino acids. Four different peptide pools were tested for cross-clade 
recognition, and they were either derived from a clade B-based isolate (gag H-b; nef- 

20 b) or a clade C-based isolate (gag H-c, nef-c). Data in Table 15 shows that T cells 
from these patients presumably infected with clade B HIV-1 could recognize clade C 
gag and nef antigens in ELISPOT assay. Correlation analysis further demonstrated 
that these T cell responses against clade C gag peptide pool were about 60% of the 
clade B counterpart (Figure 24), while the T cell responses against clade C nef were ^ 

25 about 85% of the clade B counterpart (Figure 25). These results suggest that cellular 
immune responses generated in patients infected with clade B HIV-1 can recognize 
gag and nef antigens derived from clade C HTV-l. These data show that a HIV 
vaccine, such as a DNA or MRKAd5-based adenoviral vaccine expressing a clade B 
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gag and/or nef antigen will potentially have the ability to provide a prophylactic 
and/or therapetic advantage on a global scale. 



5 Table 15 

Responses Shown as the Number of gIFN-Secreting T Cells per Million 
PBMCs 



subject 


bleed date 


gag epitope # | mock 


gag H-b 


gagH-c 


nef-b 


nef-c 






from mapping) 










#100 


19-JUI-99 


12 


10 


3950 


1385 


1295 


1300 


#101 


26-JUI-99 


3 


15 


3885 


1280 


na 


1020 


#102 


25-JUI-99 


4 


15 


1740 


850 


1255 


1785 


#104 


7-Jun-99 


2 


5 


1355 


1185 


na 


1060 


#107 


n-ocr-99 


2 


25 


3305 


2795 


670 


870 


#405 


ll-Jul-99 


2 


15 


4575 


3180 


1700 


1500 


#501 


19-JUI-99 


2 


15 


1100 


570 


3365 


3460 


#505 


I8-JUI-99 


5 


10 


2145 


1725 


1235 


na 


#506 


28-Feb-99 


2 


25 


150 


45 


400 


610 


#701 


2e-Mar-99 


5 


30 


7620 


4775 


3320 


2780 


#709 


17-May-99 


3 


15 


2785 


1945 


1090 


1630 


#710 


24-May-99 


4 


5 


1055 


1080 


2210 


2140 



















10 EXAMPLE 26 

Characterization and Production of MRKAdSpol and MRKAdSnef 
Vectors in Roller Bottles 

Expansion ofnefojidpolAdenovectors - Nef and pol CsCl purified MRKAdS 
15 seeds were used to infect roller bottles to produce P4 virus to be used as a seed for 
further experiments. P4 MRKAd5 pol and nef vectors were used to infect roller 
bottles at an MOI 280 vp/cell, except for hCMV-tpa-nef [£3+] which was infected at 
an MOI of 125 due to low titers of seed obtained at P4. 

20 Table 16 Viral particle concentrations for P5 nef and pol adenovectors 



Adenovector 


AEX Titer 
(10'° vp/ml culture) 


AEX Titer 
(10" vp/cell) 


Amplification 
Ratio 


hCMV-FL-nef [E3+] 


1.1 


0.9 


30 


mCMV-FL-nef [E3+] 


2.2 


2.1 


75 


hCMV-tpa-nef [E3+] 


0.07 


0.1 


5 


mCMV-tpa-nef IE3+] 


1.3 


0.9 


35 


hCMV-FL-pol [E3+] 


2.7 


2.1 


75 


hCMV-FL-pol [E3-] 


1.9 


1.3 


45 
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5 Roller Bottle Passaging - Passaging of the pol and /ze/constructs continued 

through passage seven. Cell-associated (freeze/thaw lysis) and whole broth (triton- 
lysis) titers obtained in all passages were very consistent. In general, MRKAd5pol is 
ca, 70% as productive as MRKAdSgag while MKKAdSnef is ca, 25% as productive 
as MRKAdSgag. Samples of P7 virus for both constructs were analyzed by V&CB by 

10 restriction digest analysis and did not show any rearrangements. 



Table 17. Passage Six Viral Productivity for MRKAdSpol and MRKAd5ncf 





Xvi&ble (U 
Viabili 
InfectlOD 


)" cells/ml), 
ty(%) 
Harvest 


Cell Passage 
Number 


AEX Titer 
(Cell Associated) 
10" vp/ml culture 


•nter 
ID* vp/ccll 


AmpIificstiOQ 
Ratio 


Tfiton Lysis 'liter 
10'* vp^ml culture 


hCMV-FI^nef [B3+] pool 


1.22,859b 




62 


0.8 


0.7 


25 


1.6 


1 

2 




0.99.62% 
1.10.72% 












hCMV-FI^pol [B3+1 pool 


1.42, 89% 




62 


4.5 


3.2 


115 


7.0 


I 
2 




1.22, 70% 
1.42.74% 












15 Table 18. 


Passage Seven Viral Productivity for MRKAdSpoI and 


MRKAdSnef 


Xviab]e(lC 
Viabil 
Infection 


1*^ ceUs/ml), 

ty(%) 

Harvest 


Cell Passage 
Number 


AEX Titer 
(Cell Associated) 
lO^vp^ culture 


Titer 
10*viVccU 


Amplification 
Ratio 


Triton Lysis Titer 
lO^'^vp/tolcultuie 


hCMV-FL-nei[B3+-] Pool 


133,90% 




66 


1.0 


0.8 


29 


2.1 


1 

2 




0.96.70% 
1.18.73% . 












hCMV-FL-pol [E3+1 Pool 


0.90*. 90% 




56 


42 


4.7 


168 


6.5 


1 

2 




1.18. 88% 
1.04.80% 













MRKAdSnef and MRKAdSpoI Viral Production Kinetics - A timecourse 
experiment was carried out in roller bottles to determine if the viral production 
kinetics of the MRKAd5pol and MRKAdSnef vectors were similar to those of 

20 MRKAd5gag. PER.C6® cells in roller bottle cultures were infected at an MOI of 280 
vp/cells with P5 MRKAd5pol, P5 MRKAd5nef and P7 MRKAdSgag; for each 
adeno vector, two infected bottles were sampled at 24, 36, 48, and 60 hours post 
infection. In addition, two bottles were left unsamplcd until 48 hpi when they were 
harvested under the Phase I process conditions. The anion-exchange HPLC viral 

25 particle concentrations of the freeze-thaw recovered cell associated virus at the 24, 36, 
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48, and 60 hpi timepoints are shown in Figure 29A-B. The QPA titers show a similar 
trend (data not shown). 

Comparison of hCMV- md mCMV-FL-nef - As the titers obtained with the 
5 MRKAdSnef construct (hCMV-FL-nef) were lower than those obtained with 
MRKAdSgag or MRXAd5pol, a viral productivity comparison experiment was 
performed with mCMV-EL-nef . For each of the two adcnovcctors (hCMV- and 
mCMV-FL-neO, two roller bottles were infected at an MOI of 280 vp/cell with 
passage five clarified lysate. The macroscopic and microscopic observations of the 
10 four roller bottles were identical at the time of harvest. Analysis of the clarified lysate 
produced indicated a higher viral particle concentration in the bottles infected with 
mCMV-FL-ncf» as shown in Tabic 19. It is stipulated that the higher productivity with 
mCMV promoter driven nef vector is due to lower nef expression levels in PER.C6® 
cells- experiments are underway at V&CB to measure nef expression levels. 

15 

Table 19. Passage Six Viral Productivity Comparison of hCMV- and mCMV-FL-nef 







Xv (10* cells/ml), ViahiUty (%) 


Cell Passage 


AEX Titer 


Tiler 


AmpUfication 


Triton Lysis Titer 






bifecdoD 


Hoivest 


Number 


10" vpAnl culture 


10*vp/cel] 


Ratio 


10*° vp/knl culture 


hCMV-FUaef 


Pool 


1.11,91* 




60 


1.5 


1.4 


SO 


ZS 


(MRKAdSnef) 


1 

2 




123.75% 
134.74% 












mCMV-FUncf 


Pool 


1.11,91% 




60 


2.3 


2.1 


75 


4.6 




] 




149, 84% 














2 




1.18,77* 













20 EXAMPLE 27 

Characterization and Large Scale Production of 
MRKAdSnef Virus in Bioreactors 
Materials and Methods - The experiment of the present example was run twice 
under the following conditions: 36.5°C, DO 30%, pH 7.30, ISOrpm agitation rate, 
25 no sparging, life Technologies (Gibco. Invitrogen) 293 SFM U (with 6mM L- 

glutamine), 0.5M NaOH as base for pH control. During the first run (B200101 15), 
two lOL stirred vessel bioreactors were inoculated with PER.C6® cells at a 
concentration of 0.2x10^ cells/ml. Cells were grown until they reached a cell 
concentration of approximately 1x10^ cells/ml. The cells were infected with uncloned 
30 MRKAdSnef (G2A.LLAA) at a MOI of 280 virus particles (vp)/ccll. For the second 
batch (B200 10202), the same procedure as the first run was used, except the cells 
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were infected with cloned MRAdSnef . During both runs, the bioreactors were 
harvested 48 hours post-infection. Samples were taken and virus concentrations were 
determined from whole broth (with triton lysis), supernatant, and cell pellets (3 X 
freeze/thaw) with the AEX and QPA assays. Metabolites were measured with 
5 BioProfile 250 throughout the process. 



Table 20: Experimenta] 


I Conditions 


Temperature 


36.5 '^C 


DO 


30% 


PH 


7.30 


A^tation 


150 rpm 


Spatfiing 


None 


Table 21: Virus source used for experiments. 



10 



Run 


Batch ID 


Cloned/Uncloned 


MOI 






MRKAd5nef 


(vp/ccDs) 


#1 


B20010115-1 


Uncloned 


280 




B200I0115-2 


Uncloned 


280 


#2 


B20010202-1 


Cloned 


280 




B20010202-2 


Cloned 


280 



Results - Table 22 and 23 show an the ability to scale up production of 
MRKAdSnef by growth in a bioreactor. 



15 



Table 22: Virus Concentration as measured by the AEX assay 



Run 


Batch ID 


Cloned/Uncloned 
MRKAd5nef 


Virus Concentration @ 48hpi (1x10*^ vp/L) 


Supernatant 


Clarified Lysate 


Total 


Triton Lysate 




B200101I5-1 


Uncloned 


0.72 


3.26 


3.98 


5.76 


B20010115-2 


Uncloned 


0.38 


1.67 


2.05 


2,46 


#2 


B20010202-1 


Cloned 


0.80 


6.00 


6.80 


8.88 


B20010202-2 


Cloned 


0.50 


6.00 


6.50 


8.47 



Table 23: Virus Titers as measured by the QPA assay 



Run 


Batch ID 


QonedAJncloned 
MRKAdSnef 


Virus Concentration ® 48hpi (IxlO" lU/L) 


Whole 
Broth 


Supernatant 


Qarified 
Lysate 


Total 


Triton 
Lysate 


#1 


B200101154 


Uncloned 


0.13 


1.12 


1.76 


2.88 


11.28 


B20010115-2 


Uncloned 


0.14 


0.73 


1.54 


2.27 


5.86 


#2 


B20010202-1 


Cloned 


0.14 


0.97 


1.62 


2.69 


11.89 


B20010202-2 


Cloned 


0.14 


1.17 


1.70 


2.97 


12.47 



20 



The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in 
addition to those described herein will become apparent to those skilled in the art 
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from the foregoing description. Such modifications are intended to fall within the 
scope of the appended claims. 

EXAMPLE 28 

5 MRKAdSHIV-lgag Boosting of DNA-Primed Animals 

Groups of 3-5 rhesus macaques were immunized with (a) 5 mgs of VlJns- 
Flgag (pVIJnsCMV(no intron>FL-gag-bGHpA), (b) 5 mgs of VlJns-Flgag 
formulated with 45 mgs of a non-ionic block copolymer CRL1005, or (c) 5 mgs of 

10 VI Jns-Hgag formulated with 7.5 mgs of CRL1005 and 0.6 mM benzalkonium 
chloride at weeks 0, 4, and 8. All animals received a single dose of 10e7 viral 
particles (vp) of the MRKAd5HIV-lgag at week 26, Note: lOe? is too low to prime 
or boost effectively when used as a single modality (dose is selected to mimic 
preexposure to adenovirus); see Figure 32. 

15 Blood samples were collected from all animals at several time points and 

peripheral blood mononuclear cells (PBMCs) were prepared using standard FicoU 
method. The PBMCs were counted and analyzed for gamma-interferon secretion 
using the ELISpot assay (Table 24). For each monkey, the PBMCs were incubated 
overnight either in the absence (medium) or presence of a pool (called "gag H") of 50 

20 20-aa long peptides that encompass the entire HTV-l gag sequence. 

The results indicate that MRKAd5HIV-lgag was very effective in boosting 
the T cell immune responses in these monkeys. At week 28 or 2 weeks after the viral 
boost, the number of gag-specific T cells per million PBMCs increased 2-48 fold 
compared to the levels observed at week 24 or 2 weeks prior to the boost. 

25 The PBMCs were also analyzed by intracellular gamma-interferon staining 

prior to (at week 10) and after the MRKAd5gag boost (at week 30). The results for 
select animals are shown on Figure 31. The results indicate that (a) immunization 
with DNA/adjuvant formulation elicited T cell responses which can either be 
balanced, CD4"*"-biased or CD8'*"-biased, and (b) boosting with the MRKAdSgag 

30 construct produced in all cases a strongly CDS'^-biased response. These results 

suggest that boosting with MRKAdSHIV-lgag construct is able to improve the levels 
of antigen-specific CDS"*" T cells. 
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EXAMPLE 29 

Construction of gagpol fusion for MRKAdSgagpol fusion constructs 

The open reading frames for the codon-optimized HTV-l gag gene was fused 
5 directly to the open reading frame of the lA pol gene (consisting of RT, RNAseH and 
integrase domains) by stepwise PGR. Because the gene (SEQ ID NO: 38) does not 
include the protease gene and the frameshift sequence, it encodes a single polypeptide 
of the combined size of p55, RT, RNAse H and integrase (1350 amino acids; SEQ ID 
NO: 39). 

10 The fragment that extends from the BstEII site within the gag gene to the last 

non-stop codon was ligated via PGR to a fragment that extends from the start codon 
of the lApol to a unique BamH/ site. This fragment was digested with BstEII and 
BamHi". Gonstruction of gag-IApol fusion was achieved via three-fragment ligation 
involving the PstZ-BstE// gag digestion fragment, the BstE/7/BamH/ digested PGR 

15 product and long Pst//BaraH/ VlR-FLpol backbone fragment. 

The MRKAdS-gagpol adenovirus vector was constructed using the Bgl/7 
fragment of the VlR-gagpol containing the entire ORF of gag-IApol fusion gene. 

EXAMPLE 30 

20 Lnmunogenicity Studies in Non-Human Primates 

Gohorts of three (3) macaques were immunized with lOeS or lOelO viral 
particles (vp) of one of the following MRKAd5 HTV-l vaccines: (1) MRKAdSgag; 
(2) MRKAdSpol; (3) MRKAd5nef; (4) a mixture containing equal amounts of 
25 MRKAdSgag, MRKAdSpol, and MRKAdSnef, or (5) a mixture of equal amounts of 
MRKAdSgagpol and MRKAdSnef. The vaccines were administered at weeks 0 and 
4. 

The T cell responses against each of the HTV-l antigens were assayed by IFN- 
ganmia EUSpot assay using pools of 20-aa peptides that encompass the entire protein 

30 sequence of each antigen. The results (Table 25) are expressed as the number of spot- 
forming cells (sfc) per million peripheral blood mononuclear cells (PBMG) that 
respond to each of the peptide pools. 

Results indicate the following observations: (1) each of the single gene 
constructs (MRKAdSgag, MRKAdSpol, or MRKAdSnef) is able to elicit high levels 

35 of antigen-specific T cells in monkeys; (2) the single-gene MRKAdS constructs can 
be mixed as a multi-cocktail formulation capable of eliciting very broad T cell 
responses against gag, pol, and nef; (3) the MRKAdS vector expressing the fusion 
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protein of gag plus lA pel is capable of inducing strong T cell responses to both gag 
and pol. 



Table 25. Evaluation of Mixtures of MRKAdS vectors expressing humanized 
HIV-1 gag, pol, gagpol, nef in rhesus macaques 



Grp# 



Vaccine 
T=0, 4 wks 



Monk# 



T=6 wks 



Mock 



GagH 



Pol-1 



Pol -2 



MRKAdS gag 
10^10 vp 



CB9V 
CD19 
109H 



15 
374 
843 



MRKAdS gag 
10^ vp 



99D130 
W277 
143H 



1 

16 
4 



948 
324 
595 



MRKAdS pol 
lO^IOvp 



CC1X 
AW3W 
AV43 



46 
463 
95 



256 
550 
1333 



MRKAdS pol 
10^8 vp 



AW38 
CC8K 
CC21 



19 
50 
33 



30 
995 
436 



MRKAdS nef 
10^10 vp 



076Q 
091 Q 
083Q 



MRKAdS nef 
10A8VP 



00C029 
980022 
980160 



1 
6 
3 



MRKAd5gag+MRKAd5pol+MRKAd5nef 
10^10 vp each 



99D251 

05H 
00C016 



206 
135 
26 



15 
21 
4 



MRKAd5gag+MRKAd5pol+MRKAd5nef 
10^ vp each 



990215 
81 H 
12H 



1 
5 
8 



171 
73 
1140 



18 
6 

115 



193 
9 
51 



193 
14 
811 



MRKAdSgagpol +MRKAd5 nef 
1 0^1 Ovp each 



99D211 
22H 
61 H 



0 
4 
4 



63 
385 
343 



56 
119 
11 



838 
1194 
765 



10 



MRKAdSgagpol +MRKAd5 nef 
10^ vp each 



34H 
48H 
70H 



78 
65 
158 



19 
105 
15 



5 

46 
220 



Indicated are numbers of spot-fonning cells per million PBMCS against the peptide pools. Mock, no peptides; gag 
H, fifty 20-aa peptides encompassing p55 sequence; pol-1, 20-aa peptides representing N-tenninal half of lA pol; 
pol-2, 20-aa peptides representing the carboxy-terminal half of lA pol; nef, 20-aa peptides encompassing the entire 
wild-type nef sequence. Responses to the antigens prior to the first immunization did not exceed 40 sfc/lC^ 
PBMC. 
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WHAT IS CLAIMED IS 

1. A recombinant adenoviral vaccine vector at least partially deleted in 
5 El and devoid of El activity, comprising: 

a) an adenovirus c/.s-acting packaging region corresponding to from 
about base pair 1 to between from about base pair 400 to about 
base pair 458 of a wildtype adenovirus genome; and 

b) a gene encoding an HIV protein or inamunologically relevant 
10 modification thereof. 

2. A vector in accordance with claim 1 comprising a packaging region 
corresponding to from about base pair 1 to about base pair 450 of a wildtype 
^^novirus genome. 

3. A vector in accordance with claim 1 further comprising nucleotides 
15 corresponding to between from about base pair 351 1 to about 3524 to about base pair 

5798 of a wildtype adenovirus genome. 

4. A vector in accordance with claim 3 comprising base pairs 
corresponding to 1-450 and 3511-5798 of a wildtype adenovirus genome. 

5. A vector in accordance with claim 4 which is deleted of base pairs 

20 451-3510. 

6. A vector in accordance with claim 1 which is at least partially 

deleted in E3. 

7. A vector in accordance with claim 6 wherein the E3 deleted region 
is from base pairs 28,133-30,818. 
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8. A vector in accordance with claim 1 wherein the gene encoding the 
HIV protein or modification thereof comprises codons optimized for expression in a 
human. 

9. A vector in accordance with claim 1 wherein the vector comprises a 
gene expression cassette comprising: 

a) a nucleic acid encoding a protein; 

b) a heterologous promoter operatively linked to the nucleic acid 
encoding the protein; and 

(c) a transcription termination sequence. 

10. A vector in accordance with claim 9 wherein the gene expression 
cassette is inserted into the El region. 

1 1. An adenoviral vector in accordance with claim 9 wherein the gene 
expression cassette is in an El parallel orientation 

12. An adenoviral vector in accordance with claim 9 wherein the gene 
expression cassette is in an El antiparallel orientation. 

13. An adenoviral vector in accordance with claim 9 wherein the 
promoter is a cytomegalovirus promoter devoid of intronic sequences. 

14. An adenoviral vector in accordance with claim 13 wherein the 
promoter is an immediate early human cytomegalovirus promoter. 

15. An adenoviral vector in accordance with claim 9 wherein the 
promoter is a murine cytomegalovirus piromoter, 

16. An adenoviral vector in accordance with claim 9 wherein the 
transcription termination sequence is a bovine growth hormone polyadenylation and 
transcription termination sequence. 
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17. An adenoviral vector in accordance with claim 9 wherein the 
transcription termination sequence is a synthetic polyadenylation signal (SPA). 

18. A cell comprising the adraoviral vector of claim 1. 

19. Recombinant, replication-defective adenovirus particles harvested 
5 and purified subsequent to transfection of the adenoviral vector of claim 1 into a cell 

line which expresses adenovirus El protein at complementing levels. 

20. An HTV vaccine composition comprising purified adenovirus 
particles of claim 19. 

21. An HIV vaccine composition of claim 20 which comprises a 
10 physiologically acceptable carrier. 

22. A method of producing recombinant, replication defective 
adenovirus particles containing the adenoviral genome of the adenoviral vector of 
claim 1 which comprises introducing the adenoviral vector into a host cell which 
expresses adenoviral El protein, and harvesting the resultant recombinant, 

15 replication-defective adenovirus. 

23. A method according to claim 22 wherein the cell is a PER.C6® 

cell. 

24. A method of generating a cellular-mediated inmiune response 
against HIV in an individual comprising administering to the individual a vaccine of 

20 claim 21. 

25. A method according to claim 24 which further comprises 
administration to the individual a DNA plasmid vaccine, optionally adnainistered with 
a biologically effective adjuvant, protein or other agent capable of increasing the 
immune response. 
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26. A method according to claim 25 wherein the DNA plasmid 
vaccine is administered to the individual prior to administration of an adenovirus 
vaccine. 

27. A method according to claim 24 wherein the adenovirus vaccine is 
preceded by an adenovims vaccine of a different serotype. 

28. A method according to claim 24 which comprises administering 
and readministering the adenovirus vaccine vector to the individual. 

29. An adenoviral vector in accordance with claim 1 wherein the HIV 
protein is HIV gag or an immunologically relevant modification thereof. 

30. An adenoviral vector in accordance with claim 9 wherein the gene 
expression cassette comprises an open reading frame encoding an HIV gag protein or 
immunologically relevant modification thereof. 

3 1. A recombinant adenoviral vaccine vector at least partially deleted 
in EI and devoid of El activity, comprising: 

a) an adenovirus cf.y-acting packaging region corresponding to from 
about base pair 1 to about base pair 4S0 and from about 351 1 to about 
5798 of a wildtype adenovirus genome, and deleted for base pairs 
corresponding to from about base pair 451 to from about base pair 
3510 of a wildtype adenovuns genome; and 

b) a gene expression cassette comprising 

i) SEQIDNO:29; 

ii) a heterologous promoter operatively linked to i); and 

iii) a transcription termination sequence, 
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32. An adenoviral vector in accordance with claim 31 wherein the 
gene expression cassette is in an El parallel orientation. 

33 An adenoviral vector in accordance with claim 31 wherein the gene 
expression cassette is in an El antiparallel orientation. 

34. An adenoviral vector in accordance with claim 31 wherein the 
promoter is a cytomegalovirus promoter devoid of intronic sequences. 

35. An adenoviral vector in accordance with claim 31 wherein the 
transcription termination sequence is a bovine growth hormone polyadenylation and 
transcription termination sequence. 

36. An adenoviral vector in accordance with claim 31 which is at least 
partially deleted in E3. 

37. A cell comprising the adenoviral vector of claim 30. 

38. Recombinant, replication-defective adenovirus particles harvested 
and purified subsequent to transfection of the adenoviral vector of claim 30 into a cell 
line which expresses adenovirus El protein at complementing levels. 

39. An HIV vaccine composition comprising purified adenovirus 
particles of claim 38. 

40. An HTV vaccine composition of claim 39 which comprises a 
physiologically acceptable carrier. 

41. A method of producing recombinant, replication defective 
adenovirus particles containing the adenoviral genome of the adenoviral vector of 
claim 30 which comprises introducing the adenoviral vector into a host cell which 
expresses adenoviral El protein, and harvesting the resultant recombinant, 
replication-defective adenovirus. 
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42. A method according to claim 41 wherein the cell is a PER.C6® 

cell. 

43. A method of generating a cellular-mediated immune response 
against HIV in an individual comprising administering to the individual a vaccine of 

5 claim 21. 

44. A method according to claim 43 which further comprises 
administration to the individual a DNA plasmid vaccine, optionally administered with 
a biologically effective adjuvant, protein or other agent capable of increasing the 
inmiune response. 

10 45. A method according to claim 44 wherein the DNA plasmid 

vaccine is administered to the individual prior to administration of an adenovirus 
vaccine. 

46. A method according to claim 43 wherein the adenovirus vaccine is 
preceded by an adenovirus vaccine of a different serotype. 
15 47. A method according to claim 43 which comprises administering 

and readministering the adenovirus vaccine vector to the individual. 

48. An adenoviral vector in accordance with claim 1 wherein the HTV 
protein is HTV pol or an immunologically relevant modification thereof. 

49. An adenoviral vector in accordance with claim 9 wherein the gene 
20 expression cassette comprises an open reading frame encoding an HTV pol protein or 

inununologically relevant modification thereof. 

50. A recombinant adenoviral vaccine vector at least partially deleted 
in El and devoid of El activity, comprising: 
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a) an adenovirus m-acting packaging region conesponding to from 
about base pair 1 to about base pair 450 and from about 351 1 to about 
5798 of a wildtype adenovirus genome, and deleted for base pairs 
corresponding to from about base pair 451 to from about base pair 

5 3510 of a wildtype adenovirus genome; and 

b) a gene expression cassette comprising 

i) a nucleotide sequence selected the group consisting of 
SEQ ID NO: 1, SEQ ID NO; 5 and SEQ ID NO: 7; 

ii) a heterologous promoter operatively linked to i); and 
10 iii) a transcription termination sequence. 

51. An adenoviral vector in accordance with claim 50 wherein the 
gene expression cassette is in an El parallel orientation. 

52. An adenoviral vector in accordance with claim 50 wherein the 
gene expression cassette is in an El antiparallel orientation. 

15 53. An adenoviral vector in accordance with claim 50 wherein the 

promoter is a cytomegalovirus promoter devoid of inti'onic sequences. 

54. An adenoviral vector in accordance with claim 50 wherein the 
transcription termination sequence is a bovine growth hormone polyadenylation and 
transcription termination sequence. 
20 55. An adenoviral vector in accordance with claim 50 which is at least 

partially deleted in E3 . 

56. A cell comprising the adenoviral vector of claim 49. 
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57. Recombinant, replication-defective adenovirus particles harvested 
and purified subsequent to transfection of the adenoviral vector of claim 49 into a cell 
line which expresses adenovirus El protein at complementing levels. 

58. An HTV vaccine composition comprising purified adenovirus 
5 particles of claim 57. 

59. An HIV vaccine composition of claim 58 which comprises a 
physiologically acceptable carrier. 

60. A method of producing recombinant, replication defective 
adenovirus particles containing the adenoviral genome of the adenoviral vector of 

10 claim 49 which comprises introducing the adenoviral vector into a host cell which 
expresses adenoviral El protein, and harvesting the resultant recombinant, 
replication-defective adenovirus. 

61. A method according to claim 60 wherein the cell is a PER.C6® 

cell. 

15 62. A method of generating a cellular-mediated immune response 

against HTV in an individual comprising administering to the individual a vaccine of 
claim 59. 

63. A method according to claim 62 which further comprises 
administration to the individual a DNA plasmid vaccine, optionally adnoinistered with 

20 a biologically effective adjuvant, protein or other agent capable of increasing the 
immune response. 

64. A method according to claim 63 wherein the DNA plasmid 
vaccine is administered to the individual prior to administration of an adenovirus 
vaccine. 
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65. A method according to claim 62 wherein the adenovirus vaccine is 
preceded by an adenovirus vaccine of a different serotype. 

66. A method according to claim 62 which comprises administering 
and readministering the adenovirus vaccine vector to the individual. 

67. An adenoviral vector in accordance with claim 1 wherein the HIV 
protein is HIV nef or an immunologically relevant modification thereof. 

68. An adenoviral vector in accordance with claim 9 wherein the gene 
expression cassette comprises an open reading frame encoding an HTV nef protein or 
immunologically relevant modification thereof. 

69. A recombinant adenoviral vaccine vector at least partially deleted 
in El and devoid of El activity, comprising: 

a) an adenovirus cw-acting packaging region corresponding to from 
about base pair 1 to about base pair 450 and from about 3511 to about 
5798 of a wildtype adenovirus genome, and deleted for base pairs 
corresponding to from about base pair 451 to from about base pair 
3510 of a wildtype adenovirus genome; and 

b) a gene expression cassette comprising 

i) a nucleotide sequence selected the group consisting of 
SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13 and 
SEQIDNO: 15; 

ii) a heterologous promoter operatively linked to i); and 

iii) a transcription termination sequence. 

70. An adenoviral vector in accordance with claim 69 wherein the 
gene expression cassette is in an El parallel orientation. 
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71 . An adenoviral vector in accordance with claim 69 wherein the 
gene expression cassette is in an El antiparallel orientation. 

72. An adenoviral vector in accordance with claim 69 wherein the 
promoter is a cytomegalovirus promoter devoid of intronic sequences. 

5 73. An adenoviral vector in accordance with claim 69 wherein the 

transcription termination sequence is a bovine growth hormone polyadenylation and 
transcription termination sequence. 

74. An adenoviral vector in accordance with claim 69 which is at least 
partially deleted in E3. 
10 75, A cell comprising the adenoviral vector of claim 68. 

76. Recombinant, replication-defective adenovims particles harvested 
and purified subsequent to transfection of the adenoviral vector of claim 68 into a cell 
line which expresses adenovims El protein at complementing levels. 

77. An HTV vaccine composition comprising purified adenovims 
15 particles of claim 76. 

78. An HTV vaccine composition of claim 77 which comprises a 
physiologically acceptable carrier. 

79. A method of producing recombinant, replication defective 
adenovirus particles containing the adenoviral genome of the adenoviral vector of 

20 claim 68 which comprises introducing the adenoviral vector into a host cell which 
expresses adenoviral El protein, arid harvesting the resultant recombinant, 
replication-defective adenovirus. 

80. A method according to claim 79 wherein the cell is a PER.C6® 

cell. 
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81. A method of generating a cellular-mediated immune response 
against HTV in an individual comprising administering to the individual a vaccine of 
claim 78. 

82. A method according to claim 81 which farther comprises 

5 administration to the individual a DNA plasmid vaccine, optionally administered with 
a biologically effective adjuvant, protein or other agent capable of increasing the 
iromune response. 

83. A method according to claim 82 wherein the DNA plasmid 
vaccine is administered to the individual prior to administration of an adenovirus 

10 vaccine. 

84- A method according to claim 81 wherein the adenovirus vaccine is 
preceded by an adenovirus vaccine of a different serotype. 

85. A method according to claim 81 which comprises administering 
and readministering the adenoviras vaccine vector to the individual. 
15 86. A multivalent adenovims vaccine composition comprising 

recombinant, replication-defective adenovirus particles, wherein the adenovirus 
particles are harvested and purified from a cell line expressing adenovirus El protein, 
and wherein the particles are harvested subsequent to transfection of the cells with an 
adenoviral vector or vectors in accordance with claim 9; said vector(s) comprising a 
20 gene expression cassette or cassettes comprising nucleotide sequences encoding HTV 
proteins selected from the group consisting of: 

a) gag, pol, and nef, expressed independently from three individual 
vectors; 
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b) gag, pol, and nef , expressed independently from one vector with 
the encoding nucleic acid sequences operatively linked to distinct 
promoters and transcription termination sequences; 

c) gag, pol, and nef, expressed via two vectors, one expressing a pol- 
5 nef fusion, and another expressing gag; 

d) gag, pol, and nef, expressed via two vectors, one expressing a gag- 
pol fusion and another expressing nef; 

e) gag, pol and nef, expressed via two vectors, one expressing a nef- 
gag fusion and another expressing pol; 

10 f) gag, pol, and nef, expressed via one vector expressing a gag-pol- 

nef fusion; 

g) gag and pol, expressed independently from two individual vectors; 

h) gag and pol, expressed independently from one vector with the 
encoding nucleic acid sequences operatively linked to distinct 

15 promoters and transcription termination sequences; 

i) pol and nef, expressed independently from two individual vectors; 
j) pol and nef, expressed independently from one vector with the 

encoding nucleic acid sequences operatively linked to distinct 
promoters and transcription termination sequences; 
20 k) nef and gag, expressed independently from two individual vectors; 

1) nef and gag, expressed independently from one vector with the 
encoding nucleic acid sequences operatively linked to distinct 
promoters and transcription termination sequences; 

m) gag and pol, expressed via one vector expressing a gag-pol fusion; 

no 
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n) pol and nef , expressed via one vector expressing a pol-nef fusion; 
and 

o) nef and gag, expressed via one vector expressing a nef-gag fusion. 

87. A multivalent adenovirus vaccine composition in accordance with 
5 claim 86 wherein the gag-pol fusion consists of SEQ ID NO: 39, 

88. A multivalent adenovirus vaccine composition in accordance with 
claim 86 wherein the fused sequences have the encoding nucleic acid sequences 
operatively linked to distinct promoters and transcription termination sequences. 

89. A multivalent adenovirus vaccine composition in accordance with 
10 claim 86 wherein the fused sequences have the encoding nucleic acid sequences 

operatively linked to a single promoter, and the encoding nucleic acid sequences 
operatively linked by an internal ribosome entry sequence ("IRES")- 
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Original Adenovector Construct: 




Ev 

Figure 1: Original HIV-l gag adenovector. 
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Sequence of the open reading frame for FLKaao (human codon optimized) 

algggtgctagggcttctgtgctgtctggtggtgagctggacaagtgggagaagalcaggctgaggcctgglgg 

caagaagaagtacaagctaaagcacattglgtgggcctccagggagclggagaggtttgclgtgaaccctggc 

cigctggagacctctgagggg1gcaggcagatcctgggcx:agctccagccctccctgqaaacaggctctgagg 

agctgaggtooctgtacaacacaglggctaccctgtactgtgtgcaccagaagattgatgtgaaggacaccaag 

gaggccctggagaagattgaggaggagcagaacaagtccaagaagaaggcxicagcaggDtgctgclggc 

acaggc:aactcc3gocaggtgtcx:cagaactac^cxat1glgcagadcctcx:agggccagatggtgcaocag 

gccatctaiccccggacxctgaatgcclggglgaagglggtggaggagaaggccttrtcccctgaggtgatcc^ 

catgtlrtctgccctglctgagggtgccaaxccMggacrtgaacaccatgclgaacacagtggggggccatc 

aggclgccatgcagatgctgaaggagaccatcaatgaggaggctgctgagtgggacaggotgcatcctgtgc 

acgctggccccattgcccccggccagaigagggagcccaggggctctgacattgctggcaccacctccaccct 

ccaggagcagattggctggaigaccaacaacocccccatccctglgggggaaatctacaagaggtggalcat 

octgggcctgaacaagattgtgaggatgiactoccccacctccatcclggacatcaggcagggccccaaggag 

cccttcagggactatgtggacaggitctacaagacoctgagggctgagcaggcctcccaggaggtgaagaact 

ggaigacagagaccctgclgglgcagaalgccaaccctgactgcaagaccatcctgaaggccctgggccctg 

ctgccaccctggaggagatgatgacagcctgcx:aggggglggggggccctgglcacaaggccagggtgc1g 

gctgaggccalgicccaggtgaccaactccgccaccalcaigalgcagaggggcaactlcaggaaccagag 

gaagacagtgaagtgcttcaactgtggcaaggtgggcoecattgccaagaactgtagggcccccaggaaga 

agggctgclggaaglglggcaaggagggccaccagatgaaggaclgcaatgagaggcaggccaacttcxjtg 

ggcaaaalctggccctcccacaagggcaggcctggcaacttcctccaglccaggcctgagcccacagcccct 

cccgaggagtcctlcaggtttggggaggagaagaccacxjcccagccagaagcaggagcccatlgacaagg 

agclglaccccctggcciccdgaggtccctgtttggcaacgacccctcclcccagtaaaalaaagcccgggca 

gat CSEQIDNO:29) 
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Old Transgene: 



aCMV tntronA OAO BOM 



NewTransgenes: 



GAG BGH 



GAG SPA 



GAG BOH 



Figure 3: Diagrammatic representation of the original HIV-I gag transgene and the series of 
new transgene constructions. 
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Figure 4: Modifications mede to the current adenovector backbone in the generation of the new 
vector. 
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Expt v^ 

(packaging exi.) 




£xpt«2 
(E3 vsE3*) 



El- 



E3- 



Figure 5: virus mixing experimenls to determine the effects of the addition made to the packaging signal 
region (Expt #1 ) and analysis of the effects ol the E3 gene on viral growth (Expt. #2). The red bars denote 
the region of modifications made to the El deletion. 
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Figure 6: Autoradiograph 0I viral DNA analysis following viral mixing experiments (expts. 4^1 
and #2) as detailed in the text. 
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Figure 7 A* hCMV-FLoag-bGHp A adenoveclors constructed within the "MRr backbone. El 
parallel end El antiparallel transgene orientation wtthln the E3- and £3^ backbones were 
constructed. 
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tCUV GAG EM. 



E3- 



E3- 



E3' 



ITS* I 



E3* 



Figure 7B: bCMV^FLgao-SP A adenovectors csonstrjciecJ within the "MRK" backbone. El 
parallel and El entiparaltel iransoene orierrtalion within the E3- and £3^ backbones were 
constructed. 
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Figure 7C: mCMV-FLgag-bGHpA adenoveciors constructed wtthin the •MRK* backbone- El 
parallel and E1 antlparallel transgene orientation within the E3* and backbones were 
constructed. 
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Piasmid mixing expt: (orientation) 




Figure 6A: Eflect of transgene orientation 
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Plasmid Mixing expt: (poiy Asignao 




Figure 8B: Effect of polyadenylation signal 
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Figure 9: Viral DNAfrom the four Adgag candidates a1 P5, following fisfEU digestion. 
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Figure 10: VIraJ DNA analysis of passage 11 and/or 12 of MRKHVE3. MRKAdSgag end 
MRKAd5gag(E3-). 
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Hind m 




Figure 11: Viral DNA analysis (H/ndltl digestion) of passage 6 MRKHVE3 and MRKAdSgag used 
to initiate the viral competition study. Last two ianes are passage 11 analysis of duplicate 
passages of the competition study (each virus at MOI 280 vp). 
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Hind III 




Figure 12: Viral DNA analysis by H/ndlll digestion on high passage numbers tor MRKAdSgag in 
serum containing media with collections made at specified times. The first lane shows the 1 Kb 
DNA size marker. The other lanes represent pre^plasmid control (digested with Pad and H/ndill). 
and MRKAdSgag virus continually passaged to P16. P10 ar>d P21 (serum containing media). 
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Figure . Serum ami-p24 Levels ai 3 Wks post i.m. immunizatioD of baJb'c mice (n=10) 
wiih Van'ing Doses of Several Adjgag constructs: (A) MRK^dSgag (through passage 5): 
(B) MRK.A,d5 E3* hCMV-FLgag-bGHpA; (C) MRK.Ad5 E3' hCMV-FLgag^SPA; (D) 
MRKAd5 E3* mCMN'-FLgag-bGHpA; (D) research Loi (293 cell-derived) of AdSHIV- 
Jgag; and (F) clinical lot (Ad5gagFN0001 ) of AdSHIV-lgag. Reponed are the geometric 
mean titers (GMT) for each cohort. 



1D' 



10* 



10' 



10* 



1000 - 



100 









i * 

! 

1 






♦ 


• 




SJ 


ft ft 


ft ft 

» • 


B 


c 





/ 



ft ft ft ft 
S S I k 



ft f ft ft 

i i I i 



16/144 



wo 02/22080 



PCTAJSOl/28861 



£^0^1(37465) 
AomH 1(37459) 

ApaUOim ' 
Apo U (35901) 
£coRI(35466) 
Smfl 1(34875) 
Xffla]04873) 
l/md m (34449) . 
5fna 1(32612) 
XfTU>I(32610) 
M'nd 10(31512) 
Apo U (30969) 
£coTll (29568) \ 
Apo U (28370) 
ApoUOZ20]) 
5mfl 1(27049) 
I (27047) 
EcoR I (26850) 
Sma] (26819) 
Xma 1(26817) 
Hind m (25853) 
>l;>olJ (25132) .\ 
Smc 3 (23439) : 
X»7W 1(23437) ' 
Apa U (22535) ' 
Apa U (22235)* 

i^HsU (21148) 
&imHiai087) 

5hioI(l9899) 
»iwl(l«97) 




LI (1534) 
>g>fl U (1699) 
iV»aU(1924) 
J6M 1(2783) 
' 5moI(2785) 

Xma 1(3465) 
5mfl 1 (3467) 
Xmfl 1 (3645) 
. 'Siiifll(3647) 
'•'i4pfllJ(4034) 
Mnd 111 (5766) 
Apa U (5940) 
J&m> 1 (6108) 
5ma 1(6110) 
>l;>flU{7641) 
M ndm (11090) 
Xnui 1(12588) 
5m0 1(12590) 
• ,W«din (13171) 
/find m (13246) 
JOruJ 1(13987) 
5wfl 1(13989) 
iV»U 05089) 
KM m (17844) 
J&R0|(18442) 
Smc 1(18444) 



17/144 



wo 02/22080 



PCT/USOl/28861 



c 

IT. 

< 




o o 



o 



o 

SD 



o 

17^ 



o e 



o 

\0 



. 18/144 



wo 02/22080 



PCT/USOl/28861 



b2 



C 
C 




o 
o 



D O 



o 

CM 



O 



ra 



o 



O 

o 



19/144 



wo 02/22080 PCT/USOl/28861 




20/144 



wo 02/22080 PCTAJSOl/28861 




21/144 



wo 02/22080 PCT/USOl/28861 




22/144 



wo 02/22080 



PCTAJSOl/28861 



£ 



JT 




o 



in 



to 



e 

r- 

CD 



00 



23/144 



wo 02/22080 



PCTAJSOl/28861 




24/144 



wo 02/22080 



PCTAJSOl/28861 




wo 02/22080 



PCT/USOl/28861 




26/144 



wo 02/22080 



PCTAJSOl/28861 



a: 
b: 



c 
c 

XT 




o 
in 



o 

CD 



e o e o o 
r* #o ^ if> \D 
in i/> tT) m in 



o 



in 



o 



27/144 




28/144 



wo 02/22080 



PCTAJSOl/28861 



cc 
\r 
c 

l: 
X 

10 

c 



c: 

i 




o 
o 



CD 



o — 



29/144 



wo 02/22080 



PCT/USOl/28861 




fi^i>trt ISM 

30/144 



wo 02/22080 PCT/USOl/28861 



e 



c 

T 




o 

fVJ 



o o o 
m V 



o 

CD 



o 
o 



31/144 



wo 02/22080 



PCT/USOl/28861 



ve. 

IE 
O 
AT* 




O O O O O D D 

O fH fM W 

r» n n ri 

CN) «>» N N IN r« w 



o 



32/144 



wo 02/22080 



PCTAJSOl/28861 



in 
< 




o 



in in 



ess 
m 

CM 



in 
m 



in 



33/144 



wo 02/22080 



PCTAJSOl/28861 



z 

er- 
ic 

i 




O 

in 

IN 



CD 



0\ 

in 



o o 

f-l cs 



4D 



IN 



O 



CD 



o o 

<M IN 



CN 



34/144 



wo 02/22080 PCTAJSOl/28861 



IN 
C 
ID 

£ 

T 
< 




N IN 



O 



O O O O O O O 

t>N. r» r» A CD CD CO 



00 



CD 



35/144 



wo 02/22080 



PCT/USOl/28861 



V» 

a 
£ 

D 

IC 
C 
in 

< 

I 




t ^ "5 ^ 





o o 

M M fH 



O 



CM 



CM 



.D e 

IN rs 



9s 



2 S 



36/144 



wo 02/22080 



PCT/USOl/28861 



cc 
\o 
c 
u 

X 

D: 
IC 

tr. 
in 

< 




37/144 



wo 02/22080 PCT/USOl/28861 




38/144 



wo 02/22080 



PCTAJSOl/28861 



c 
in 

A. 




GO 



o o ^ S 5 5 

f.. CO o> o 

V V « in in m 

r> f*» »** i*i «^ 



39/144 



wo 02/22080 



PCTAJSOl/28861 




o ooooooooDooooo eo 



40/144 



wo 02/22080 



PCTAJSOl/28861 



o 
e 

s: 
< 



41/144 



wo 02/22080 



PCT/USOl/28861 




Sfil Bglll 



FIGURE 16 
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AGAKT ACCATGGCCCCCATCTCCCCCATTGACACTC7CCCTGTGAAGCTGAAGCCTGGCATGGATGGCCCCAASG1GAA 
eoTiT MelAIoProIleSerProllcGluThrVolProVolLysLeulysProGlyMelAspGlyProLysVolLy 
1 10 20 • • 

GCAGTGGCCCCTGACTGAGGAGAAGATCAAGCCCCTGCTGGAAATCTGCACTGAGATGGAGAAGGAGGGCAAAAICT^^^ 
sGln7rpProLeuThrCluGlu^ysIleLysAloLeuVol6luIIeCysThrCli*«elGluLysGluGlyLyslleSerL 
30 40 50 

AGATTGGCCCCGAGAACCCCTACAACACCCCTGTGTnGCCATCAAGAAGAAGGACTCCACCAAGTGGAGGAACCTGGTG 
ysIleGlyProGluAsnProTyrAsnThrProVolPheAlolleLysLysLysAspSerThrLysTrpArgLysLeuVol 

GACnCAGGGAGCTGAACAAGACGACCCACGACTTClGGGAGGlGCAGClGGGCAICCCCCACCCCGCTGGCCTGAAGAA 
AspPheArgGluLeuAsnLysArgThrGlnAspPheTrpGluVolGlnLeuGlyllcProHisProAloGlyLeuLysLy 
80 90 100 

GAAGAAGlCTGTGACTGTGCTGGCTGTGGGGGATGCCTACnClCTGTGCCCCTGGATGAGGACTTCAGCAAGTACACTG 
slysLysSerVolThrVolLeuAloVolGlyAspAloTyrPhcSerVolProLcuAspGluAspPheArgLysTyrThrA 
110 120 130 

CCTTCACCA1CCCCTCCA1CAACAATGAGACCCCTGGCATCAGGTACCAGTACAATGTGCTGCCCCAGGGCTGGAAGGGC 
loPheTnrlleProSerlleAsnAsnGluThrProGlylleArQlyrGlnlyrAsnVolLeuProGlnGlyTrpLysGly 

140 150 

TCCCClGCCATCTTCCAClCCTCCAlGACCAAGATCCTGGAGCCCnCAGGAAGCAGMCCClGACATTGTGATClACCA 
SerProAlol lePheGlnSerSerMelThrLysl leLeuGluProPheArgLysGlnAsnProAspI leVol I leTyrGI 
160 170 180 

GTACATCGCTGCCC1C1ATGTGGGCTCTGACCTGGAGAT1GGGCAGCACAGGACCAAGATTGAGGAGCTGAGGCAGCACC 
nTyrMelAloAloleuTyrVolGlySerAspLeuGlulleGlyGlnHisArglhrLysllcGliCluLeuArgpinHisL 
190 200 210 

7GC1GAGGTGGGGCC1GACCACCCC1GACAAGAAGCACCAGAI^AG(XCCCCTTCCTG1GGATGGGCTATGAGCTG^^ 
euLeuAroTrpGlyLeuThrThrProAspLysLysHisGlnLysGluProProPheLcuTrpUelGlyTyrGluleuHis 

220 230 

CCOGACAAGTGGAClGlGGAGCCCAllCTCCTGCClGAGAAGGACTCCTaACTGTGAAlGACATCCACAC 
ProAspLysTrpThrVo IGltfro 1 1 eVoJLeuProG I ul ysAspSerTrpThrVo I AsnAspI I eG I nLysLeuVo IC I 
240 250 260 

CAAGCTDAACTGGGCCICCtAAAlClACCCTGGCAlCAACCTGAGGCAGCTGTGCAACClGCTGAGGGGCACCAAGGC^ 
yLysLeuAsnlrpAloSerGlnlleTyrProGlylleLysVolArgG^nleuCysLysLeuLeuArgGlyThrLysMoL 



HGURE 17A 
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7GACTGAGC1GAlCCCCC7CACTGAGGAGGCTGACCT(»AGClGGClGAG^ACmAGATCCTGAAGGAGCCTGTGCAT 
EiiThrGluVol I leProLeulhrGluGluAloGluleuGtuLeuAlaGluAsnArgClul leLeuLysGluProVolHis 

300 310 

CCGGTGTACTATCACCCCICCAAGGACCTGATTGCTGADAICCAGAAGCAGGGCCAGGGCCAGTGGACCTACCAAATCTA 
GiyVolTyrTyrAspProSerLysAspLeulleAloGluIleGinLysGlnGlyCinGlyGlnTrpThrTyrGlnlleTy 
320 330 340 

CCAGGAGCCCTTCAAGAACCTGAAGACTGGCAAGTATGCCACGATGAGGGGGGCCCACACCAATGATGTGAAGCAGCTGA 
rGlnGluProPheLysAsnleuLysThrGlyLysTyrAloArgMelArgGlyAloHisThrAsnAspVollysGlnLeuT 
350 3B0 370 

Ci:-AGGCTGTGCAGAAGATCACCACTGAGTCCATTGTGAlCTCGGGCAAGACCCCCAAGTTCAAGCTGaCATCCAGA« 
hrGluAloVolGlnLys]leThrThrGlu5erlleVollleTrpGlyLysThrProLysPheLysLeuProlleGlnLys 

380 390 

GAGACCTGGGAGACCTGGTGGACTGAGUCTGGCAGGCCACCTGGATCCClCAGTGGGAGTnGTGAACACCCCCCCCCT 
GluThrTrpGluThrTrpTrpThrGluTyrTrpGlnAlolhrTrplleProGluTrpGluPheVolAsnThrProProLe 
400 410 420 

GCTGAAGCTGIGGIACCAGCTGGAGAAGGAGCCCATIGIGGGGGCTGAGACCTTCTATGIGGCTGGGGCTGCCA^CAGGG 
uVolLysLeuTrpTyrClnLeuGluLysGluProIleVolGlyAloGluThrPheTyrVoJAIaGlyAloAloAsnArgG 
430 440 450 

AGACCAAGCTGGGCAACGCTCGCTATGTGACCAACAGGCGCAGGCAGAAOGICGTGACCCTGACTGACAC^ 
luThrLysLeuGlylysAloGlyTyrVolThrAsnArgGlyArgGlnLysVolVolThrLeuThrAspThrThrAsnGIn 

460 470 

AAGACTGCCC1CCAGGCCATCTACC16GCCCTCCAGGACTCTGGCCTGGAGGTGACA1TGTGACTGCCTCCCAGTATGC 
LysThrAloLeuGlnAloIleTyrLeuAloLeuGlnAspSerGlyLeuGluVolAsnlleVolThrAloSerGlnTyrAI 
460 490 500 

CC1GGCCATCATCCAGGCCCAGCCTGATCACTCTGACTCT6AGCTGGTGAACCAGATCATT6AGCAGCTGA1CAAGAAGG 
oLeuGlyl lei leGlnAloGlnProAspGlnSerGluSerGluLeuVolAsnGlnl iel leGli£lnleuI leLysLysG 
510 520 530 

/ICAAIX^TGTACCTCGCCTGGGTGCCTGCCCACAAGOGCATTGGGGGCAATGM^ 
IvLysVolTyrLeuAioTrpVolProAloHisLysGlylieGlyGlyAsnGluGlnVolAspLysLeuVolSerAloGly 
540 550 

A1CAGGAAGGTGCTGTTCCTGGATGGCATTCACAAGCCCCAGGATCAGCATGAGAAG1ACCACTCCAACTGGAGGGCTAT 
lleAroLysVolLeuPheLeuAspGlyllcAspLysAloGlnAspGluHisGluLysTyrHisSerAsnTrpArgAloMe • 
560 570 580 



FIGURE 17B 
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GGCCTCTGACTlCMCCTGCCCCCTGT«;T(XCTAAGGAGATTGiaCClCCTCTGACAAGTGCCAGCTGAAGGGGGAGG 
tAloSerAspPheAsnLeuProProVolVolAloLysGlulleVolAloSerCysAspLysCysGlnLeuLysGlyGluA 
590 600 610 

CCATGCATGGGCAGCTGGACTGCICCaTGGCATCTCGCAGCTGGCCTGCACCCACCTGGACGGCAAGGT^^^ 
l(*lelHisG I yC InVo I AspCysSerProG 1 y I leTrpG I nLeuA I oCysThrHi sLeiC 1 «X5 1 yLysVolIULeuVo I 

620 630 

CCTCTGCATGTG(^TIXGGCTACAnGAMK:TGAGGTGATCCCTGCTGAGACAGGCCAGGAGACTGCCTAC7TCCTGC^ 
AloVolHisVolAloSerGlyTyrlleGIuAloGluVolIleProAloGluThrGlyClnGluThrAIoTyrPheLeuLc 
640 650 650 

GAAGClGGCTGGCAGGTGGCCTGTGAAGACCATCCACACTGCCAATGGCTCCAACnCACTGGGGCCACAGTGAGGGCTG 
uLysLeuAlOGIyArQlrpProVolLysThrlieHisThrAloAsnGlySerAsnPheThrGlyAloThrVolArgAloA 
' 670 680 690 

CCTGCTGGTGGCCTCGCATCAACCAGGAGinGGCAlCCCCTACAACCCCCAGTCCCAGGGGGTGGTGGCCTCCAlGAAC 
I oCysT rpTrpAl oG I y 1 1 eLysG I nG I uPheG I y IleProTyrAsnProG InSerG I nGI yVo I Vo I AhSerMelAsn 

700 710 

AAGGAGCTGAAGAAGAICATTGGGCAGGTGAGGGACCACCCTGAGCACCTGAAGACAGCTGTGCAGATGGCTGTGTTCAT 
LysGluLcuLysLysllelleGlyGlnVolArgAspGlnAloGluHisLeuLysThrAloVolGlnMelAloVolPhell 
720 730 740 

CCACAACTTCAAGAGGAAGGGGGGCATCGGGGGCTAC1CCGCTGGGGAGAGGATTG7GGACATCATTGCCACAGACATCC 
eHisAsnPheLysArgLysGlyClylleGlyGlyTyrSerAloGlyGluArglleVolAspllellcAloThrAspIleG 
750 760 770 

AGACCAAGGAGCTCCAGAAGCAGATCACCAAGAlCCAGAACTTCAGGGTGTACTACAGGGACTCCAGGAACCaXTGTC^ 
InThrLysGluLeuGlnLysGlnlleThrLysIleGlnAsnPheArgVolTyrTyrArgAspSerArgAsnProLeuTrp 

' 780 790 

AAGGGCCCTGCCAAGCTGCIGTGGAAGGGGGAHXJGGCTGTGGTGATCCAOJACAACTCTGACATCAAGGTGGTGCC^ 
LysGlyProAloLysLeuleuTrpLysGlyGluGlyAloVolVollleGlnAspAsnScrAsplleLysVolVolProAr 

800 81D 820 

GAGGAAGGCCAAGATCATCAGGGACIATGGCAACCAGATGGCTGGGGATGACTCIGIGGCCICC^^ 
QArgLysAloLysMellfiArgAspTyrGlylysClnMelAloGlyAspAspCysVfllAloSerArspinAspGlutepx 

B30 840 850 



AAACCCCGGGCAGATC; (SEQ id NQ: 3) 

XX Bgnl'" (SEQIDN0:4) 
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wr 

OPT 

wr 

OPT 

wr 

OPT 

wr 

OPT 

WT 
OPT 

VfT . 
OPT 

WT 
OPT 

WT 
OPT 

WT 
OPT 

HT 

OPT 



ATG GGT GGC AAG T6G TCA AM CGT AGT 6TG CCT GGA T66 TCT -42 

III II III III III II II I III I'l 11 III II 
ATG GGC GGC AAG TGG TCC AAG AGG TCC GTG CCC GGC TGG TCC 
HGGKWSKRSVP6WS -14 

ACT GTA AGG' gAA AGA ATG AGA CGA 6CT GAG CCA 6CA 6CA GAT -B4 

II II III II II III II I 11 III II II II II 
ACC GTTG AGG Gt^ AGG ATG AGG AGG GCC GAG CCC GCC GCC GAG 

TVRERHRRAEPAAD -28 

AGG GTG AGA CGA ACT GAG CCA GCA GCA GTA GGG GTG GGA GGA >126 

III III II I II III II II II II II III II II 
AGG GTG AGG AGG ACC GAG CCC GCC GCC GTG GGC GTG GGC GCC 

RVRRTEPAAVGV6A -42 

GTA TCT CGA GAG CTG GAA AAA CAT GGA GCA ATC ACA AGT AGC -168 

II II I III 111 II II II II II III II I 
GTG TCC AGG GAC CTG GAG AAG CAC GGC GCC ATC ACC TCC TCC 
VSRDLEKHGAITSS -56 

AAT ACA GCA GCT ACC AAT GCT GAT TGT GCC TGG CTA GAA GCA -210 

II II II II III II II II II 111 111 II II II 
AAC ACC GCC GCC ACC AAC GCC GAC TGC- GCC TGG CTG GAG GCC 
NTAATNADCAWLEA -70. 

CAA GAG GAT GAG GAA GTG GGT TIT CCA GTC AGA CCT CAG GTA -252 

II III II III II III II II II II II II III II 
CAG GAG GAC GAG GAG GTG GGC TTC CCC GTG AGG CCC CAG GTG 
QEDEEVGFPVRPQV -84 

CCT TTA AGA CCA ATG ACT TAC AAG GGA GCT GTA GAT CTT AGC -294 

II I II II III II III III II II II II II I 
CCC CTG AGG CCC ATG ACC TAC AAG GGC GCC GTG GAC CTG TCC 

PLRPMTYKGAVDLS -98 

CAC TTT TTA AAA GAA AAG GGG GGA CTG GAA GGG CTA ATT CAC -336 

III II 1 II II III II II III II II II II III 
CAC TTC CTG AAG GAG AAG GGC GGC CTG GAG GGC CTG ATC CAC 

HFLKEKGGLEGLIH -112 

TCA CAG AAA AGA CAA GAT ATC CTT GAT CTG TGG GTC TAC CAC -378 

II III II II II II III II II III III II III III 
TCC CAG AAG AGG CAG GAC ATC CTG GAC CTG TGG GTG TAC CAC 
SQKRQDI LDL WV YH -126 

ACA CAA GGC TAC TTC CCT GAT TGG CAG AAC TAC ACA CCA GGG -420 

II II III 111 III II II III III III 111 II II II 
ACC CAG GGC TAC TTC CCC GAC TGG CAG AAC TAC ACC CCC GGC 
TQGYFPDWONYTPG -140 
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CCA 6SA ATC AGA TTT CCA TTG ACC TTT GBA TGG TQC TTC AAG -462 

II II III II II II II III II II III III III III 
CCC BGC ATC AGG TTC CCC CTG ACC TTC GGC TGG TGC TTC AAG 
PGIRFPLTFGWCFK -154 

CTA GTA CCA GTT GAG CCA 6AA AAG GTA 6AA GAG GCC AAT GAA -504 

II II II 11 III II II III II II 111 III It II 
aG GTG CCC GTG GAG CCC GAG AAG GTG GAG GAG GCC AAC GAG 
LVPVEPEKVEEANE -168 

GGA GAG AAC AAC TGC TTG TIA CAC CCT ATG AGC CAG CAT G6G -546 

II III III III III II I III II III I III II II 
GGC GAG AAC AAC TGC CTG CTG CAC CCC ATG TCC CAG CAC GGC 
GEN NCLLHPMSQHG -182 

ATA GAG GAC CCG GAG AAG GAA GTG TTA GAG TGG AGG TH GAC -588 

II III III II III III II III I I III III III I I II 
ATC GAG GAC CCC GAG AAG GAG GTG CTG GAG TGG AGG TTC GAC 
lEDPEKEVLEWRFD -196 

AGC AAG CTA GCA TTT CAT CAC GTG GCC C6A GAG CTG CAT CCG -630 

I III II II II II III III III I III III II II 
TCC AAG CTG GCC TTC CAC CAC GTG GCC AGG GAG CTG CAC CCC 

SKLAFHHVARELHP -210 

GAG TAC TAC AAG GAC TGC TSA (SEQ ID N0:30) -651 

6A6 TAC TAC AAG GAC TGC TAA (contained within SEQIDN0:9) 
E Y Y K D C (SEQ ID NO: 10) -216 
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HIV'Pol'inacI (opt) 



MRKptffiSI (Pacrpl>UpacM50)^MVm]WOHpA(str.) 




KanR 



Bgl 11 /phosphatase 



Bd11 1(i5Dlale pol gene fragment) 




Ligation 

Screen for correct orientation of pol 



HlV'Pd-inaci (opt) 




C\^^ disBSt pre-plasmid 



Digest with Pad and BstllOT I (or BstZIO? I) 



Bacterial Homotogous Recombination (BJ51 63) 
XL-1 blue (plasmid amplification) 



HIV'Pot-inaci (opt) 



Figure 1: Construction of pre-plasmid T)MRKAd5por 
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BgM 1 /phosphatase Bgtl 1 (isolate nef gene fragment) 



Ligation 

Screen for correct orientation of nef, 
















A06 4 














Clai digest pre-plasmid 



A« lliniMl39) 

Digest with Pad and Bst1l07 I (orBstZ107 I) 



Bacteria) Homologous Recombination (BJ5163) 
XL-1 blue (plasmid amplification) 



Figure 2: Construction of pre-plasmid "pMRKAdSnef. 
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MRKAd5po)MER1062 
(MRKAdS Pre-Adenovira] Vecior Containing the lA opt pol Cocbng Region) 



1 CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG 
GTAGTAGTTA TTATATGGAA TAAAACCTAA CTTCGGTTAT ACTATTACTC 

51 GGGGTGGAGT TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG 
CCCCACCTCA AACACTCCAC CCCGCCCCCC ACCCTTGCCC CGCCCACTGC 

101 TAGTAGTGTG 6CGGAAGTGT GATGTTGCAA GTGTGGCGGA ACACATGTAA 
ATCATCACAC CGCCTTCACA CTACAACGTT CACACCGCCT TGTGTACATT 

151 GCGACGGATG TGGCAAAA6T GACGTTTTTG GTGTGCGCCG GTGTACACAG 
CGCTGCCTAC ACC6TTTTCA CT6CAAAAAC CACACGCGGC CACATGTGTC 

201 GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG TAAATTTGGG 
CTTCACTGTT AAAAGCGCGC CAAAATCCGC CTACAACATC ATTTAAACCC 

251 CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 
GCATTGGCTC ATTCTAAACC GGTAAAAGCG CCCTTTTGAC TTATTCTCCT 

3 01 AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA 
TCACTTTAGA CTTATTAAAA CACAATGAGT ATCGCGCATT ATAAACAGAT 

351 GGGCCGCGGG GACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT 
CCCGGCGCCC CTGAAACTGG CAAATGCACC TCTGAGCGGG TCCACAAAAA 

401 CTCAGGTGTT TTCCGCGTTC CGGGTCAT^G TTGGCGTTTT ATTATTATAG 
GAGTCCACAA AAGGCGCAAG GCCCAGTTTC AACCGCAAAA TAATAATATC 

451 GCGGCCGCGA TCCATTGCAT ACGTTGTATC CATATCATAA TATGTACATT 
CGCCGGCGCT AGGTAACGTA TGCAACATAG GTATAGTATT ATACATGTAA 

501 TATATTGGCT CATGTCCAAC ATTACCGCCA TGTTGACATT GATTATTGAC 
ATATAACCGA GTACAGGTTG TAATGGCGGT ACAACTGTAA CTAATAACTG 

551 TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 
ATCAATAATT ATCATTAGTT AATGCCCCAG TAATCAAGTA TC6GGTATAT 

601 TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG 
ACCTCAAGGC GCAATGTATT GAATGCCATT TACCGGGCGG ACCGACTGGC 

651 CCCAACGACC CCCGCCCATT GACGTCAATA AT6ACGTATG TTCCCATAGT 
GGGTTQCTGG .GGGCGGGTAA CTGCAGTTAT TACTGCATAC AAGGGTATCA 

701 AACGCCAATA G6GACTTTCC ATT6ACGTCA ATGGGTGGAG TATTTACG6T 
. TTGCGGTTAT CCCTGAAAGG TAACTGCA6T TACCCACCTC ATAAATGCCA 

751 AAACTGCCCA CrTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC 
TTTGACGGGT GAACCGTCAT GTAGTTCACA TAGTATACGG TTCATGCGGG 

601 CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA 
GGATAACTGC AGTTACTGCC ATTTACCGGG CGGACCGTAA TACGGGTCAT 

B51 CATGACCTTA TGGCACTTTC CTACTTQGCA GTACATCTAC GTATTAGTCA 
GTACTGGAAT ACCCTGAAAG GATQAACCGT CATGTAGATG CATAATCAGT 




55/144 



wo 02/22080 



PCT/USOl/28861 



901 TCGCTATTAC dPtCTGATG CGCTTTTGGC AGTACATCAA TGGGCd^feA 
AGCGATAATG GTACCACTAC GCCAAAACCG TCATGTAGTT ACCCGCACCT 

951 TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 
ATCGCCAAAC TGAGTGCCCC TAAAGGTTCA 6AGGTGGGGT AACTGCAGTT 

1001 T6G6AGTTTG TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA 
ACCCTCAAAC AAAACCGTGG TTTTAGTTGC CCTGAAAGGT TTTACAGCAT 

1051 ACAACTCCGC CCCATTGACG CAAATGQGCG GTAGGCQTGT ACGGTGGGAG 
TGTTGAG6CG G6GTAACTGC GTTTACCCGC CATCCGCACA TGCCACCCTC 

1101 GTCTATATAA GCAGAGCTCG TTTA6TGAAC CGTCAGATCG CCTGGAGACG 
CAGATATATT CGTCTCGAGC AAATCACTTG GCAGTCTAGC GGACCTCTGC 

1151 CCATCCACGC TGTTTTGACC TCCATAGAAG ACACCGGGAC CGATCCAGCC 
GGTAGGTGCG ACAAAACTGG AGGTATCTTC TGTGGCCCTG GCTAGGTCGG 

1201 TCCGCGGCCG GGAACGGTGC ATTGGAACGC GGATTCCCCG TGCCAAGAGT 
AGGCGCCGGC CCTTGCCACG TAACCTTGCG CCTAAGGGGC ACGGTTCTCA 

1251 GAGATCTACC ATGGCCCCCA TCTCCCCCAT TGAGACTGTG CCT6TGAAGC 
CTCTAGATGG TACCGGGGGT AGAGGGGGTA ACTCTGACAC GGACACTTCG 

1301 TGAAGCCTGG CATGGATGGC CCCAAGGTGA AGCAGTGGCC CCTGACTGAG 
ACTTCGGACC GTACCTACCG GGGTTCCACT TCGTCACCGG GGACTGACTC 

1351 GAGAAGATCA AGGCCCTGGT GGAAATCTGC ACTGAGATGG AGAAGGAGGG 
CTCTTCTAGT TCCGGGACCA CCTTTAGACG TGACTCTACC TCTTCCTCCC 

1401 CAAAATCTCC AAGATTGGCC CCGAGAACCC CTACAACACC CCTGTGTTTG 
GTTTTAGAGG TTCTAACCGG GGCTCTTGGG GATGTTGTGG GGACACAAAC 

1451 CCATCAAGAA GAAGGACTCC ACCAAGTGGA GGAAGCTGGT GGACTTCAGG 
GGTAGTTCTT CTTCCT6AGG TGGTTCACCT CCTTCGACCA CCTGAAGTCC 

1501 GAGCTGAACA AGAGGACCCA GGACTTCTGG GAGGTGCAGC TGGGCATCCC 
CTCGACTTGT TCTCCTGGGT CCTGAAGACC CTCCACGTCG ACCCGTAGGG 

1551 CCACCCCGCT GGCCTGAAGA AGAAGAAGTC TGTGACTGTG CTGGCTGTGG 
GGTGGGGCQA CCQGACTTCT TCTTCTTCAG ACACTGACAC GACCGACACC 

1601 GGGATGCCTA CTTCTCTGTG CCCCTGGATG AGGACTTCAG GAAGTACACT 
CCCTACGGAT GAAGAGACAC GGGGACCTAC TCCTGAAGTC CTTCATGT6A 

1651 GCCTTCACCA TCCCCTCCAT CAACAATGAG ACCCCT6GCA TCAGGTACCA 
CGGAAGTGGT A6GGGAQGTA GTTGTTACTC TGGGGACCGT AGTCCATGGT 

1701 GTACAATGTG CTGCCCCAGG GCTGGAAGGG CTCCCCTGCC ATCTTCCAGT 
CATGTTACAC GACGGGGTCC CGACCTTCCC GAGGGGACGG TAGAAGGTCA 

1751 CCTCCATGAC CAAGATCCTG GAGCCCTTCA GGAAGCAGAA CCCTGACATT 
GGAGGTACTG GTTCTAGGAC CTCGGGAAGT CCTTCGTCTT GGGACTGTAA 

1801 GTGATCTACC AGTACATGGC TGCCCTGTAT GTGGGCTCTG ACCTGGAGAT 
CACTAGATGG TCATGTACCG ACCCGACATA CACCXTGAGAC TGGACCTCTA 
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1851 TGGGCAGCAC ;>^^CCAAGA TTGAGGAGCT GAGGCAGCAC CTGCTG|«T 
ACCCGTCGTG TCCTGGTTCT AACTCCTCGA CTCCGTCGTG GACGACTCCA 

1901 GG6GCCTGAC CACCCCTGAC AAGAAGCACC A6AA6GAGCC CCCCTTCCTG 
CCCCGGACTG GTGGGGACTG TTCTTCGTGG TCTTCCTCGG GGGGAAGGAC 

1951 TGGATG6GCT ATGAGCTGCA CCCCGACAAG TGGACTGTGC AGCCCATTGT 
ACCTACCCGA TACTC6ACGT GGGGCTGTTC ACCTGACACG TCGGGTAACA 

2001 GCTGCCTGA6 AAGGACTCCT GGACTGTGAA TGACATCCAG AAGCTGGTGG 
CGACGGACTC TTCCTGAGGA CCTGACACTT ACTGTAGGTC TTCGACCACC 

2051 GCAAGCTGAA CTGGGCCTCC CAAATCTACC CTGGCATCAA GGTGAGGCAG 
CGTTCGACTT GACCCGGAGG GTTTAGATGG GACCGTAGTT CCACTCCGTC 

2101 CTGTGCAAGC TGCTGAGGGG CACCAACGCC CTGACTGAGG TGATCCCCCT 
GACACGTTCQ ACGACTCCCC GTGGTTCCGG GACTGACTCC ACTAGGGGGA 

2151 GACTGAGGAG GCTGAGCTGG AGCTGGCTGA GAACA6G6AG ATCCTGAAGG 
CTGACTCCTC CGACTCGACC TCGACCGACT CTTGTCCCTC TAGGACTTCC 

2201 AGCCTGTGCA TGGGGTGTAC TATGACCCCT CCAAGGACCT GATTGCTGAG 
TCGGACACGT ACCCCACATG ATACTGGGGA GGTTCCTGGA CTAACGACTC 

2251 ATCCAGAAGC AGGGCCAGGG CCAGTGGACC TACCAAATCT ACCAGGAGCC 
TAGGTCTTCG TCCCGGTCCC GGTCACCTGG ATGGTTTAGA TGGTCCTCGG 

2301 CTTCAAGAAC CTGAAGACTG GCAAGTATGC CAGGATGAGG GGGGCCCACA 
GAAGTTCTTG GACTTCTGAC CGTTCATACG GTCCTACTCC CCCCGGGTGT 

2351 CCAATGATGT GAAGCAGCTG ACTGAGGCTG TGCAGAAGAT CACCACTGAG 
GGTTACTACA' CTTCGTCGAC TGACTCCGAC ACCTCTTCTA GTGGTGACTC 

2401 TCCATTGTGA TCTGGGGCAA gacccccaag ttcaagctgc ccatccagaa 
aggtaacact agaccccgtt ctgggggttc aagttcgacg ggtaggtctt 

2451 ggagacctgg gagacctggt ggactgagta ctggcaggcc acctggatcc 
cctctggacc ctctggacca cctgactcat gaccgtccgg tggacctagg 

2501 ctgagtggga gtttgtgaac accccccccc tggtgaagct gtggtaccag 
gactcaccct caaacacttg tggggggggg accacttcga caccatggtc 

2551 ctggagaagg agcccattgt gggggctgag accttctatg tggctggggc 
gacctcttcc tcgggtaaca cccccgactc tggaagatac accgaccccg 

2601 tgccaacagg gagaccaagc tgggcaaggc tggctatgtg accaacaggg 
acggttgtcc ctctggttcg acccgttccg acc6atacac tggtt6tccc 

2651 gcaggcagaa ggtggtgacc ctgactgaca ccaccaacca 6aagactgcc 

CGTCCGTCTT CCACCACTGG GACTGACTGT GGTGGTTGGT CTTCTGACGG 

27 01 CTCCAGGCCA TCTACCTGGC CCTCCAGGAC TCTGGCCTGG AGGT6AACAT 
GAGGTCC6GT A6ATGGACCG GGAGGTCCTG AGACCGGACC TCCACTTGTA 

2*751 TGTGACTGCC TCCCAGTATG CCCTGGGCAT CATCCAGGCC CAGCCTGATC 
ACACTGACGG AGGGTCATAC GGGACCCGTA GTAGGTCCGG GTCGGACTAG 
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2001 AGTCTGAGTC "H^CTGCTG AACCAGATCA TTGAGCAGCT GATCAA^^G 
TCAGACTCAG ACTCGACCAC TTGGTCTAGT AACTCGTCGA CTAGTtWc 

2851 GA6AAGGTGT ACCT66CCTG G6TGCCTGCC CACAAGGGCA TT6GGGGCAA 
CTCTTCCACA TGGACCGGAC CCACGGACG6 GTGTTCCCGT AACCCCCGTT 

2901 T6AGCAGGTG GACAAGCTGG TGTCTGCTGG CATCAG6AAG GT6CTGTTCC 
ACTCGTCCAC CTGTTCGACC ACAGACGACC GTAGTCCTTC CACGACAAGG 

2951 TGGATGGCAT TGACAAGGCC CAGGATGAGC ATGAGAAGTA CCACTCCAAC 
ACCTACCGTA ACTGTTCCGG GTCCTACTCG TACTCTTCAT GGTGAGGTTG 

3001 TGGAGGGCTA TGGCCTCTGA CTTCAACCTG CCCCCTGTGG TGGCTAAGGA 
ACCTCCCGAT ACCGGAGACT GAAGTTGGAC GGGGGACACC ACCGATTCCT 

3051 GATTGTGGCC TCCTGTGACA AGTGCCAGCT GAAGGGGGAG GCCATGCATG 
CTAACACCGG AGGACACTGT TCACGGTCGA CTTCCCCCTC CGGTACGTAC 

3101 GGCAGGTGGA CTGCTCCCCT GGCATCTGGC AGCTGGCCTG CACCCACCTG 
CCGTCCACCT GACGAGGGGA CCGTAGACCG TCGACCGGAC GTGGGTGGAC 

3151 GAGGGCAAGG TGATCCTGGT GGCTGTGCAT GTGGCCTCCG GCTACATTGA 
CTCCCGTTCC ACTAGGACCA CCGACACGTA CACCGGAGGC C6ATGTAACT 

3201 GGCTGAGGTG ATCCCTGCTG AGACAGGCCA GGAGACTGCC TACTTCCTGC 
CCGACTCCAC TAGGGACGAC TCTGTCCGGT CCTCTGACGG ATGAAGGACG 

3251 TGAAGCTGGC TGGCAGGTGG CCTGTGAAGA CCATCCACAC TGCCAATGGC 
ACTTCGACCG ACCGTCCACC GGACACTTCT GGTAGGTGTG ACGGTTACCG 

3301 TCCAACTTCA CTGGGGCCAC AGTGAGGGCT GCCTGCT66T 6GGCTGGCAT 
AGGTTGAAGT GACCCCGGTG TCACTCCCGA C6GACGACCA CCCGACCGTA 

3351 CAAGCAGGAG TTTGGCATCC CCTACAACCC CCAGTCCCAG GGGGTGGTGG 
GTTCGTCCTC AAACCGTAGG GGATGTTGGG GGTCAGGGTC CCCCACCACC 

3401 CCTCCATGAA CAAGGAGCTG AAGAAGATCA TTGGGCAGGT GAGGGACCAG 
GGAGGTACTT GTTCCTCGAC TTCTTCTAGT AACCCGTCCA CTCCCTGGTC 

3451 GCTGA6CACC TGAAGACAGC TGTGCAGATG GCTGTGTTCA TCCACAACTT 
CGACTCGTG6 ACTTCrGTCG ACACGTCTAC CGACACAAGT AGCTGTTGAA 

3501 CAAGAGGAAG GGGGGCATCG GGGGCTACTC CGCTGGGGAG AGGATTGTGG 
GTTCTCCTTC CCCCCGTAGC CCCCGAT6AG 6C6ACCCCTC TCCTAACACC 

3551 ACATCATTGC CACAGACATC CAGACCAAGG AGCTCCAGAA GCAGATCACC 
TGTAGTAACG GTGTCTGTAG GTCTGGTTCC TCGAGGTCTT C6TCTAGTGG 

3601 AAGATCCAGA ACTTCAGGGT GTACTACAGG GACTCCAGGA ACCCCCTGTG 
TTCTAGGTCT TGAAGTCCCA CATGATGTCC CTGAGGTCCT TGGGGGACAC 

3651 GAAGG6CCCT GCCAAGCTGC TGTGGAAGGG GGAGGGGGCT GTGGTGATCC 
CTTCCCGGGA CGGTTCGAC6 ACACCTTCCC CCTCCCCCGA- CACCACTAGG 

3"701 AGGACAACTC TGACATCAAG GTGGTGCCCA GGAGGAAGGC CAAGATCATC 
TCCTGTTGAG ACTGTAGTTC CACCACGGGT CCTCCTTCCG GTTCTAGTAG 
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3751 AGGGACTAT<5 □•rGCAGAT GGCTGGGGAT GACTGTGTGG'T:CTCCAjjj|A 
TCCCTGATAC otRkTGTCTA CCGACCCCTA CTGACACACC GGAGGoIRt 

3801 GGATGAGGAC TAAAGCCCGG GCAGATCTGC TGTGCCTTCT AGTTGCCAGC 
CCTACTCCTG ATTTCGGGCC CGTCTAGACG ACACGGAAGA TCAACGGTCG 

3851 CATCTGTTGT TTGCCCCTCC CCCGTGCCTT CCTTGACCCT GGAA6GTGCC 
GTAGACAACA AACGGG6A66 GGGCACG6AA GGAACTGGGA CCTTCCACGG 

3901 ACTCCCACTG TCCTTTCCTA ATAAAATGAG GAAATTGCAT CGCATTGTCT 
TGAGGGTGAC AGGAAAGGAT TATTTTACTC CTTTAACGTA 6CGTAACAGA 

3951 GAGTAGGTGT CATTCTATTC TGGGGGGTGG GGTGGGGCAG GACAGCAAGG 
CTCATCCACA GTAAGATAAG ACCCCCCACC CCACCCCGTC CTGTCGTTCC 

4001 GGGAGGATTG GGAAGACAAT AGCAGGCATG CTGGGGATGC GGTGGGCTCT 
CCCTCCTAAC CCTTCTGTTA TCGTCCGTAC GACCCCTACG CCACCCGAGA 

4051 ATGGCCGATC GGCGCGCCGT ACTGAAATGT GTGGGCGTGG CXTAAGGGTG 
TACCGGCTAG CCGCGCGGCA TGACTTTACA CACCCGCACC GAATTCCCAC 

4101 GGAAAG7ATA TATAA6GTGG GGGTCTTATG TAGTTTTGTA TCTGTTTTGC 
CCTTTCTTAT ATATTCCACC CCCAGAATAC ATCAAAACAT AGACAAAACG 

4151 AGCAGCCGCC GCCGCCATGA GCACCAACTC GTTTGATGGA AGCATTGTGA 
TCGTCGGCGG CGGCGGTACT CGTGGTTGAG CAAACTACCT TCGTAACACT 

42 01 GCTCATATTT GACAACGCGC ATGCCCCXJIT GGGCCGGGGT GCGTCAGAAT 
.CGAGTATAAA CTGTTGCGCG TACGGGSGTA CCCGGCCCCA CGCAGTCTTA 

4251 GTGATGGGCT CCAGCATTGA TGGTCGCCCC GTCCTGCCCG CAAACTCTAC 
CACTACCCGA GGTCGTAACT ACCAGCQG6G CAGGACGGGC GTTTGAGATG 

4301 TACCTTGACC TACGAGACCG TGTCTGGAAC GCCGTTGGAG ACTGCAGCCT 
ATGGAACTGG ATGCTCTGGC ACAGACCTTG CGGCAACCTC TGACGTCGGA 

4351 CCGCCGCCGC TTCAGCCGCT GCAGCCACCG CCCGCGGGAT TGTGACTGAC 
GGCGGCGGCG AAGTCGGCGA CGTCGGTGGC GGGCGCCCTA ACACTGACTG 

4401 TTTGCTTTCC TGAGCCCGCT TGCAAACAGT GCAGCTTCCC GTTCATCCGC 
AAACGAAAGG ACTCGGGCGA ACGTTTGTCA CGTCGAAGGG CAAGTAGGCG 

4 451 CCGCGATGAC AAGTTGACGG CTCTTTTGGC ACAATTGGAT TCTTTGACCC 
GGCGCTACTG TTCAACTGCX GAGAAAACCG TGTTAACCTA A6AAACTGGG 

4501 GG6AACTTAA TGTCGTTTCT CAGCAGCTGT TG6ATCTGCG CCAGCAGGTT 
CCCTTGAATT ACAGCAAAGA GTCGTCGACA ACCTAGACGC GGTCGTCCAA 

4551 TCTGCCCTGA AGGCTTCCTC CCCTCCCAAT GCGGTTTAAA ACATAAATAA 
AGACGGGACT TCCGAAGGAG GGGAGGQTTA CGCCAAATTT TGTATTTATT 

4501 AAAACCAGAC TCTGTTTGGA TTTGGATCAA GCAAGTGTCT TGCTGTCTTT 
TTTTGGTCTG AGACAAACCT AAACCTA6TT CGTTCACAGA ACGACAGAAA 

4651 ATTTAGGGGT TTTGCGCGCG CGGTAGGCCC GGGACCAGCG GTCTCGGTCG 
TAAATCCCCA AAACGCGCGC GCCATCCGGG CCCTGGTCGC CAGAGCCAGC 
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4701 



TTGAGGOTCC 
AACTCCCAGG 



fTATTTT TTCCAQQACG TGGTAAA<3R3'aH5^CTC¥i 
lATAAAA AABGTCCTGC ACCATTTCCA CTGAGA^ 




4751 GTTCAGATAC ATGGGCATAA GCCCGTCTCT GGGGTGGAGG TAGCACCACT 
CAAGTCTATG TACCCGTATT COGGCAGAGA CCCCACCTCC ATC6TGGT6A 

4801 GCAGAGCTTC ATGCTGCGGG GTGGTGTTGT AGATGATCCA GTCGTAGCAG 
CGTCTCGAAG TACGACGCCC CACCACAACA TCTACTAGGT CAGCATCGTC 

4851 GAGCCCTCGG CGTGGTGCCT AAAAAIX3TCT TTCAGTAGCA AGCTGATTGC 
CTCGCGACCC GCACCACGGA TTTTTACAGA AAGTCATCGT TCGACTAACG 

4901 CAGGGGCAGG CCCTTGGTGT AAGTGTTTAC AAAGCGGTTA AGCTGGGATG 
GTCCCCGTCC GGGAACCACA TTCACAAAT6 TTTCGCCAAT TCGACCCTAC 

4951 GGTGCATACG TGGGGATATG AGATGCATCT TGGACTGTAT TTTTAGGTTG 
CCACGTATGC ACCCCTATAC TCTACGTAGA ACCTGACATA AAAATCCAAC 

5001 GCTATGTTCC CAGCCATATC CCTCCGGGGA TTCATGTTGT GCAGAACCAC 
CGATACAAGG GTC6GTATAG GGAGGCCCCT AAGTACAACA C6TCTTGGTG 

5051 CAGCACAGTG TATCCGGTGC ACTTGGGAAA TTTGTCATGT AGCTTAGAAG 
GTCGTGTCAC ATAGGCCACG TGAACCCTTT AAACAGTACA TCGAATCTTC 

5101 GAAATGCGTG GAAGAACTTG GAGACGCCCT TGTGACCTCC AAGATTTTCC 
CTTTACGCAC CTTCTTGAAC CTCTGCGGGA ACACTGGAGG TTCTAAAAGG 

5151 ATGCATTCGT CCATAATGAT GGCAATGGGC CCACGGGCGG CGGCCTGGGC 
TACGTAAGCA GGTATTACTA CCGTTACCCG GGTGCCCGCC GCCGGACCCG 

5201 GAAGATATTT . CTGGGATCAC TAACGTCATA GTTGTGTTCC AGGATGAGAT 
CrrCTATAAA GACCCTAGTG ATTGCAGTAT CAACACAAGG TCCTACTCTA 

5251 CGTCATAGGC CATTTTTACA AAGCGCGGGC GGAGGGTGCC AGACTGCGGT 
GCAGTATCCG GTAAAAATGT TTCGCGCCCG CCTCCCACGG TCTGACGCCA 

5301 ATAATGGTTC CATCCGGCCC AGGGGCGTAG TTACCCTCAC AGATTTGCAT 
TATTACCAAG GTAGGCCGGG TCCCCGCATC AATGGGAGTG TCTAAACGTA 

5351 TTCCCACGCT TTGAGTTCAG ATGGGGGGAT CATGTCTACC TGCGGGGCGA 
AAGGGTGCGA AACTCAAGTC TACCCCCCTA GTACAGATGG ACGCCCCGCT 

5401 TGAAGAAAAC GGTTTCCGGG GTAGGGGAGA TCAGCTGGGA AGAAAGCAGG 
ACTTCTTTTG CCAAAGGCCC CATCCCCTCT AGTCGACCCT TCTTTCGTCC 

5451 TTCCTGAGCA GCTGCGACTT ACCGCAGCCG GTGGGCCCGT AAATCACACC 
AAGGACTCGT CGACGCTGAA TGGCGTCGGC CACCCGGGCA TTTAGTGTGG 

5501 TATTACCGGC TGCAACTGGT AGTTAAGAGA GCTGCAGCTG CCGTCATCCC 
ATAATGGCCG ACGTTGACCA TCAATTCTCT CGACGTCGAC GGCAGTACGG 

5551 TGAGCAGGGG GGCCACTTCG TTAAGCAT6T CCCTGACTCG CATGTTTTCC 
ACTC6TCCCC CCGGTGAAGC AATTCGTACA 6GGACTGAGC GTACAAAAGG 

5601 CTGACCAAAT CCGCCAGAAG GCGCTCGCCG CCCAGCGATA GCAGTTCTT6 
GACTGGTTTA GGCGGTCTTC CGCGAGCGGC GGGTCGCTAT CGTCAAGAAC 
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5651 CAAGGAAGCA aIWtTTTCA ACGGTTTGAG ACCGTCCGCC GTAGGCWBC 
GTTCCTTCGT TTCAAAAA6T TGCCAAACTC TGGCAGGCGG CATCCGTACG 

5701 TTTTGAGCGT TTGACCAAGC AGTTCCAGGC GGTCCCACAG CTCGGTCACC 
AAAACTCGCA AACTGGTTCG TCAAGGTCCG CCAGGGTGTC GAGCCAGTGG 

5751 TGCTCTACGG CATCTCGATC CAGCATATCT CCTCGTTTC6 CGGGTTGGGG 
ACGAGATGCC GTAGAGCTAG GTCGTATAGA GGAGCAAAGC GCCCAACCCC 

5801 CG6CTTTCGC TGTACGGCA6 TAGTCGGTGC TCGTCCAGAC GGGCCAGGGT 
GCCGAAAGC6 ACATGCC6TC ATCAGCCAC6 AGCAGGTCTG CCC6GTCCCA 

5851 CATGTCTTTC CACGGGCGCA GGGTCCTCGT CAGCGTAGTC TGGGTCACGG 
GTACAGAAAG GTGCCCGCGT CCCAGGAGCA GTCGCATCAG ACCCAGTGCC 

5901 TGAAGGGGTG CGCTCCGGGC TGCGCGCTGG CCAGGGTGCG CTTGAGGCTG 
ACTTCCCCAC GCGAGGCCCG ACGCGCGACC GGTCCCACGC GAACTCCGAC 

5951 GTCCTGCTGG TGCTGAAGCG CTGCCGGTCT TCGCCCTGCG CGTCGGCCAG 
CAGGACGACC ACGACTTCGC GACGGCCAGA AGCGGGACGC GCAGCCGGTC 

6001 GTAGCATTTG ACCATGGTGT CATAGTCCAG CCCCTCCGCG GCGTGGCCCT 
CATCGTAAAC TGGTACCACA GTATCAGGTC GGGGAGGCGC CGCACCGGGA 

6051 TGGCGCGCA6 CTTGCCCTTG GAGGAGGCGC CGCACGAGGG GCAGTGCAGA 
ACCGCGCGTC GAACGGGAAC CTCCTCC6CG GCGTGCTCCC CGTCACGTCT 

6101 CTTTTGAGGG CGTAGAGCTT GGGCGCGAGA AATACCGATT CCGGGGAGTA 
GAAAACTCCC GCATCTCGAA CCCGCGCTCT TTATGGCTAA GGCCCCTCAT 

6151 GGCATCCGCG CCGCAGGCCC. CGCAGACGGT CTCGCATTCC ACGAGCCAGG 
CCGTAGGCGC GGCGTCCGGG GCGTCTGCCA GA6CGTAAGG TGCTCGGTCC 

6201 TGAGCTCTGG CCGTTC6GGG TCAAAAACCA GGTTTCCCCC ATGCTTTTTG 
ACTCGAGACC GGCAAGCCCC AGTTTTTGGT CCAAAGGGGG TACGAAAAAC 

6251 ATGCGTTTCT TACCTCTGGT TTCCATGAGC CGGTGTCCAC GCTCGGTGAC 
TACGCAAAGA ATGGAGACGA AAGGTACTCG GCCACAGGTG CGAGCCACTG 

6301 GAAAAGGCTG TCCGTGTCCC CGTATACAQA CTTGAGAGGC CTGTCCTCGA 
CTTTTCCGAC AGGCACAGGG GCATAT6TCT GAACTCTCCG GACAGGAGCT 

6351 GCGGTGTTCC GCGGTCCTCC TCGTATAGAA ACTCGGACCA CTCTGAGACA 
CGCCACAAGG CGCCAGGAGG AGCATATCTT TGAGCCTGGT GAGACTCTGT 

6401 AA6GCTCGCG TCCAGGCCAG CACGAAGGAG GCTAAGTGGG AGGGGTAGCG 
TTCCGAGCGC A66TCCGGTC GTGCTTCCTC CGATTCACCC TCCCCATCGC 

6451 GTCGTTGTCC ACTAGGGGGT CCACTCGCTC CAGGGTGTGA AGACACATGT 
CA6CAACAGG TGATCCCCCA GGTGAGCGAG GTCCCACACT TCTGTGTACA 

6501 CGCCCTCTTC GGCATCAAGG AAGGTGATTG GTTTGTAGGT GTAGGCCACG 
GCGGGAGAAG CCGTAGTTCC TTCCACTAAC CAAACATCCA CATCCGGTGC 

6551 TGACCGGGTG TTCCTGAAGG GGGGCTATAA AAGGGGGTGG GGGCGCGTTC 
ACTGGCCCAC AAGGACTTCC CCCCGATATT TTCCCCCACC CCCGCGCAAG 
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6601 GTCCTCACTC TCJTCCGCAT CGCTGTCTGC GAGGGCCJ8SaJr®'ET Q3Egr G 
CAGGAGTGAG ;^iGGCGTA GCGACAGACG CTCCCGGTCG ACAACC^C 

6651 AGTACTCCCT CTGAAAAGG6 GGCATGACTT CTGCGCTAAG ATTGTCAGTT 
TCATGAGGGA GACTTTTCGC CCGTACTGAA GACGCGATTC TAACAGTCAA 

6701 TCCT^IAAACG AGGAGGATTT GATATTCACC TGGCCCGCGG TGATGCCTTT 
AGGTTTTTGC TCCTCCTAAA CTATAAGTGG ACCGGGCGCC ACTACGGAAA 

6751 GAGGGTGGCC GC ATCCATCT GGTCAGAAAA GACAATCTTT TTGTTGTCAA 
CTCCCACC6G CGTAGGTAGA CCACTCTTTT CTGTTA6AAA AACAACAGTT 

6801 GCTTGGTGGC AAACGACCCG TAGAGGGCGT TGGACA6CAA CTTGGCGATG 
CGAACCACCG TTTGCTGGGC ATCTCCCGCA ACCTGTCGTT GAACCGCTAC 

6851 GAGCGCAGGG TTTGGTTTTT GTCGCGATCG 6CGCGCTCCT TGGCCGCGAT 
CTCGCGTCCC AAACCAAAAA CAGCGCTAGC CGCGCCAGGA ACCGGCGCTA 

6901 GTTTAGCTGC ACGTATTCGC GCGCAACGCA CCGCCATTCG GGAAAGACGG 
CAAATCGACG TGCATAAGCG CGCGTTGCGT GGCGGTAAGC CCTTTCTGCC 

6951 TGGTGCGCTC GTCGGGCACC AGGTGCACGC GCCAACCGCG GTTGTGCAGG 
ACCACGCGAG CAGCCCGTGG TCCACGTGCG CGGTTGGCGC CAACACGTCC 

7001 GTGACAAGGT CAACGCTGGT GGCTACCTCT CCGCGTAGGC GCTCGTTGGT 
CACTGTTCCA GTTGCGACCA CCGATGGA6A GGCGCATCCG CGAGCAACCA 

7051 CCAGCAGAGG CGGCCGCCCT TGC6CGAGCA GAATGGCGGT AGGGGGTCTA 
GGTCGTCTCC GCCGGCGGGA ACGCGCTCGT CTTACCGCCA TCCCCCAGAT 

7101 GCTGCGTCTC GTCCGGGGGG TCTGCGTCCA CGGTAAAGAC CCCGGGCAGC 
CGACGCAGAG CAGGCCCCCC AGACGCAGGT GCCATTTCTG GGGCCCGTCG 

7151 AGGCGCGCGT CGAAGTAGTC TATCTTGCAT CCTTGCAA6T CTAGCGCCTG 
TCCGCGCGCA GCTTCATCAG ATAGAACGTA GGAACGTTCA GATCGCGGAC 

7201 CTGCCATGCG CGGGCGGCAA GCGCGCGCTC GTATGGGTTG AGTGGGGGAC 
GACGGTACGC GCCCGCCGTT CGCGCGCGAG CATACCCAAC TCACCCCCTG 

7251 CCCATGGCAT GGCCTGGGTG AGCGCGGAGG CGTACATGCC GCAAATGTCG 
GGGTACCGTA CCCCACCCAC TCGCGCCTCC GCATGTACGG CGTTTACAGC 

7301 TAAACGTAGA GGGGCTCTCT GAGTATTCCA AGATATGTAG GGTAGCATCT 
ATTTGCATCT C3CCCGAGAGA CTCATAA66T TCTATACATC CCATCGTAGA 

7351 TCCACCGCGG ATGCTGGCGC GCACGTAATC GTATAGTTCG TGCGAGGGAG 
AGGTGGCGCC TACGACCGCG CGTGCATTAG CATATCAAGC ACGCTCCCTC 

7401 C6AGGAGGTC GGGACCGAGG TTGCTACGGG CGGGCTGCTC TGCTCGGAAG 
GCTCCTCCAG CCCTGGCTCC AACGATGCCC GCCCGACGAG ACGAGCCTTC 

7451 ACTATCTGCC TGAAGATGGC ATGTGAGTTG GATGATATG6 TTGGACGCTG 
TGATAGACGG ACTTCTACCG TACACTCAAC CTACTATACC AACCTGCGAC 

7501 GAAGACGTTG AAGCTGGCGT CT6TGAGACC TACCGCGTCA CGCACGAAGG 
CTTCTGCAAC TTCGACCGCA GACACTCTGG ATGGCGCAGT GCGTGCTTCC 
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7551 AG6CGTAGGA G^pCGCAGC TTGTTGAC'CA GCTC66CGGT GACCTGapG 
TCCGCATCCT CAGCGCGTCG AACAACTGGT CGAGCCGCCA CTGGACGTCC 

7601 TCTAGGGC6C AGTAGTCCAG GGTTTCCTTG ATGATGTCAT ACTTATCCTG 
AGATCCCGCG TCATCAGGTC CCAAAGGAAC TACTACAGTA TGAATAGGAC 

7651 TCCCTTTTTT TTCCACAGCT CGCGGTTGAG GACAAACTCT TCGCGGTCTT 
AGGGAAAAAA AAGGTGTCGA 6CGCCAACTC CTGTTTGAGA AGCGCCAGAA 

7701 TCCAGTACTC TTGGATCGGA AACCCGTCGG CCTCCGAACG GTAAGAGCCT 
AGGTCATGAG AACCTAGCCT TTGGGCAGCC GGAGGCTTGC CATTCTCGGA 

7751 AGCATGTAGA ACTGGTTGAC GGCCTGGTAG GCGCAGCATC CCTTTTCTAC 
TCGTACATCT TGACCAACTG CCGGACCATC CGCGTCGTAG GGAAAAGATG 

7801 GGGTAGC6CG TATGCCTGCG CGGCCTTCCG GAGCGAGGTG TGGGTGACCG 
CCCATCGCGC ATACGGACGC GCCGGAA6GC CTCGCTCCAC ACCCACTCGC 

7851 CAAAGGTGTC CCTGACCATG ACTTTGAGGT ACTGGTATTT GAAGTCAGTG 
GTTTCCACAG GGACTGGTAC TGAAACTCCA TGACCATAAA CTTCAGTCAC 

7901 TCGTCGCATC CGCCCTGCTC CCAGAGCAAA AAGTCCGTGC GCTTTTTGGA 
AGCAGCGTAG GCGGGACGAG GGTCTCGTTT TTCAGGCACG CGAAAAACCT 

7951 ACGCGGATTT GGCAGGGCGA AGGTGACATC GTTGAAGAGT ATCTTTCCCG 
TGCGCCTAAA CCGTCCCGCT TCCACTGTAG CAACTTCTCA TAGAAAGGGC 

8001 CGCGAGGCAT AAAGTTGCGT GT6ATGCGGA AGGGTCCCGG CACCTCGGAA 
GCGCTCCGTA TTTCAAC6CA CACTACGCCT TCCCAGGGCC GTGGAGCCTT 

8051 CGGTTGTTAA TTACCTGGGC GGCGAGCACG ATCTCGTCAA AGCCGTTGAT 
GCCAACAATT AATGGACCCG CCGCTCGTGC TAGAGCAGTT TCGGCAACTA 

8101 GTTGTGGCCC . ACAATGTAAA GTTCCAAGAA GCGCGGGATG CCCTTGATGG 
CAACACCGGG TGTTACATTT CAAGGTTCTT CGCGCCCTAC GGGAACTACC 

8151 AAGGCAATTT TTTAAGTTCC TCGTAGGTGA GCTCTTCAGG GGAGCTGAGC 
TTCCGTTAAA AAATTCAAGG AGCATCCACT CGAGAAGTCC CCTCGACTCG 

8201 CCGTGCTCTG AAAGGGCCCA GTCTQCAAQA TGAGGGTTGG AAGCGACGAA 
GGCACGAGAC TTTCCCGGGT CAGACGTTCT ACTCCCAACC TTCGCTGCTT 

8251 TGAGCTCCAC AGGTCACGGG CCATTAGCAT TTGCAGGTGG TCGCGAAAGG 
ACTCGAGGTG TCCAGTGCCC GGTAATCGTA AACGTCCACC AGCGCTTTCC 

8301 TCCTAAACT6 GCGACCTATG GCCATTTTTT CTGGGGTGAT GCAGTAGAAG 
AGGATTTGAC CGCTGGATAC CG6TAAAAAA GACCCCACTA CGTCATCTTC 

8351 GTAAGCGGGT CTTGTTCCCA GCGGTCCCAT CCAAGGTTCG CGGCTAGGTC 
CATTCGCCCA GAACAAGGGT CGCCAGGGTA GGTTCCAAGC GCCGATCCAG 

8401 TCGCGCGGCA GTCACTAGAG GCTCATCTCC GCC6AACTTC ATGACCAGCA 
AGC6CGCC6T CA6TGATCTC CGAGTA6AGG CGGCTTGAAG TACTGGTCGT 

8451 TGAAGGGCAC GAGCTGCTTC CCAAAGGCCC CCATCCAAGT ATAGGTCTCT 
ACTTCCCGTG CTCGAC6AAG GGTTTCCGGG GGTAGGTTCA TATCCAGAGA 
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■ 8501 ACATCGTAGG iMkAAAGAG ACGCTCGGTG CGAGGATGCG AGCCGA^G 
TGTAGCATCC ACTGTTTCTC TGCGAGCCAC GCTCCTACGC TCGGCtSBcC 

8551 GAAGAACTGG ATCTCCCGCC ACCAATTGGA GGAGTGGCTA TTGATGTGGT 
CTTCTTGACC TAGAGGGCGG TGGTTAACCT CCTCACCGAT AACTACACCA 

8601 GAAAGTAGAA GTCCCTGCGA CGGGCCGAAC ACTCGTGCTG GCTTTTGTAA 
CTTTCATCTT CA6GGACGCT GCCCGGCTTG TGACCACGAC CGAAAACATT 

8651 AAACGTGCGC AGTACTGGCA GCGGTGCACG GGCTGTACAT CCTGCACGAG 
TTTGCACGCG TCATGACCGT CGCCACGTGC CCGACATGTA GGACGTGCTC 

8701 GTTGACCTGA CGACCGCGCA CAAGGAAGCA GAGTGGGAAT TTGAGCCCCT 
CAACTGGACT GCTGGCGCGT GTTCCTTCGT CTCACCCTTA AACTCGGGGA 

8751 CGCCTGGCGG GTTTGGCTGG TGGTCTTCTA CTTCGGCTGC TTGTCCTTGA 
GCGGACCGCC CAAACCGACC ACCAGAAGAT GAAGCCGACG AACAGGAACT 

8801 CCGTCTGGCT GCTCGA66GG AGTTACGGTG GATC6GACCA CCACGCCGCG 
GGCAGACCGA CGAGCTCCCC TCAATGCCAC CTAGCCTGGT GGTGCGGCGC 

8851 CGAGCCCAAA CTCCAGATGT CCGCGCGCGG CGGTCGGAGC TTGATGACAA 
GCTCGGGTTT CAGGTCTACA GGCGCGCGCC GCCAGCCTCG AACTACTGTT 

8901 CATCGCGCAG ATGGGAGCTG TCCATGGTCT GGAGCTCCCG CGGCGTCAGG 
GTAGCGCGTC TACCCTCGAC AGGTACCAGA CCTCGAGGGC GCCGCAGTCC 

8951 TCAGGCGGGA 6CTCCTGCAG GTTTACCTCG CATAGACGGG TCAGGGCGCG 
AGTCCGCCCT CGAGGACGTC CAAAT6GAGC GTATCTGCCC AGTCCCGCGC 

9001 GGCTAGATCC AGGTGATACC TAATTTCCAG GGGCTGGTTG GTGGCGGCGT 
CCGATCTAGG TCCACTATGG ATTAAAGGTC CCCGACCAAC CACCGCCGCA 

9051 CGATGGCTTG CAAGAGGCCG CATCCCCGCG GCGCGACTAC GGTACCGCGC 
GCTACCGAAC GTTCTCCGGC GTAGGGGCGC CGCGCTGATG CCATGGCGCG 

9101 GGCGGGCGGT GGGCCGCGGG GGTGTCCTTG GATGATGCAT CTAAAAGCGG 
CCGCCCGCCA CCCGGCGCCC CCACAGGAAC CTACTACGTA GATTTTCGCC 

9151 TGACGCGGGC GAGCCCCCGG AGGTA6GGGG GGCTCCGGAC CCGCCGGGAG 
ACTCCGCCCG CTCGGGGGCC TCCATCCCCC CCGAGGCCTG GGCGGCCCTC 

9201 AGGGGGCAGG GGCACGTCGG CGCCGCGCGC GGGCAGGAGC TGGTGCTGCG 
TCCCCCGTCC CCGTGCAGCC GCGGCGCGCG CCCGTCCTCG ACCACGACGC 

9251 CGCGTAGGTT GCTGGCGAAC GCGACGACGC GGCGOTTGAT CTCCTGAATC 
GCGCATCCAA CGACCGCTTG CGCTGCTGCG CCGCCAACTA GAGGACTTAG 

9301 TGGCGCCTCT GCGTGAAGAC GACGGGCCCG GTGAGCTTGA ACCTGAAAGA 
ACC6CGGAGA CGCACTTCTG CTGCCCGGGC CACTCGAACT TGGACTTTCT 

9351 GAGTTCGACA GAATCAATTT CGGTGTCGTT 6ACGGCGGCC TGGCGCAAAA 
CTCAA6CTGT CTTAGTTAAA GCCACAGCAA CTGCCGCC6G ACCGCGTTTT 

9401 TCTCCTGCAC GTCTCCTGAC TTGTCTTGAT AGGCGATCTC GGCCATGAAC 
AGAGGACGTG CAGAGGACTC AACAGAACTA TCCGCTAGAG CCGGTACTTG 
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9451 TGCTCGATCT C^BCTCCTG QAGATCTCCG CGTCCGGCTC GCTCCA^pT 
ACGAGCTAGA gSJgGAGGAC CTCTAGAGGC GCAGGCCGAG CGAGGTGCCA 

9501 GGCGGCGAGG TCGTTGGAAA TGCGGGCCAT GAGCTGCGAG AAGGCGTTGA 
CCGCCGCTCC AGCAACCTTT ACGCCCGGTA ctcgacgctc ttccgcaact 

9551 GGCCTCCCTC GTTCCA6ACG CGGCTGTAGA CCACGCCCCC TTCGGCATCG 
CCGGAGGGAG CAAGGTCTGC GCCGACATCT GGTGCGGGGG AAGCCGTAGC 

9601 CGGGCGCGCA TGACCACCTG CGCGAGATTG AGCTCCACGT GCCGGGCGAA 
GCCCGCGCGT ACTGGTGGAC GCGCTCTAAC TCGAGGTGCA CGGCCCGCTT 

9651 GACGGCGTAG TTTCGCAGGC GCTGAAAGAG GTAGTTGAGG GTGGTGGCGG 

CTGCCGCATC AAAGCGTCCG cgactttctc catcaactcc caccaccgcc 

9701 tgtgttctgc cacgaagaag tacataaccc agcgtcgcaa cgtggattcg 
ACACAAGACG GTGCTTCTTC ATGTATTGGG TCGCAGCGTr gcacctaagc 

9751 ttgatatccc ccaaggcctc aa6gcgctcc atggcctcgt agaagtccac 
aactataggg ggttccggag ttccgcgagg taccggagca tcttcaggtg 

9801 ggcgaagttg aaaaactggg agttgcgcgc cgacacggtt aactcctcct 

CCGCTTCAAC TTTTTGACCC TCAACGCGC6 GCTGTGCCAA TTGAGGAGGA 

9851 CCAGAAGACG GATGAGCTCG GCGACAGTGT CGCGCACCTC GCGCTCAAAG 
GGTCTTCTGC CTACTCGAGC CGCTGTCACA GCGCGTGGAG cgcgagtttc 

9901 gctacagggg cctcttcttc ttcttcaatc tcctcttcca taagggcctc 
cgatgtcccc ggagaagaag aagaagttag aggagaaggt attcccggag 

9951 CCCTTCTTCT TCTTCTGGCG GCGGTGGGGG AGGGGGGACA CGGCGGCGAC 
GGGAAGAAGA AGAAGACCGC CGCCACCCCC TCCCCCCTGT GCCGCCGCTG 

10001 6ACGGCGCAC CGGGAGGCGG TCGACAAAGC GCTCGATCAT CTCCCCGCGG 
CTGCCGCGTG GCCCTCCGCC AGCTGTTTCG CGAGCTAGTA GAGGGGCGCC 

10051 CGACGGCGCA TGGTCTCGGT GACGGCGCGG CCGTTCTCGC GGGGGCGCAG 
GCTGCCGCGT ACCAGAGCCA CTGCCGCGCC GGCAAGAGCG CCCCCGCGTC 

10101 TTG<3AAGACG CCGCCCGTCA TGTCCCGGTT ATGGGTTGGC GGGGGGCTGC 
AACCTTCTGC GGCGGGCAGT ACAGGGCCAA TACCCAACCG CCCCCCGACG 

10151 CATGCGGCAG GGATACGGCG CTAACGATGC ATCTCAACAA TTGTTGTGTA 
GTACGCCGTC CCTATGCCGC GATTGCTACG TAGAGTTGTT AACAACACAT 

10201 GGTACTCC6C C6CCGAGGGA CCTGAGCGAG TCCGCATCGA CCGGATCGGA 
CCATGAGGCG GCCGCTCCCT GGACTCGCTC AGGCGTAGCT GGCCTAGCCT 

10251 AAACCTCTCG AGAAAGGCGT CTAACCAGTC ACAGTCGCAA GGTAGGCTGA 
TTT6GAGAGC TCTTTCCGCA GATTGGTCAG TGTCAGCGTT CCATCCGACT 

10301 GCACCGTGGC GGGC6GCAGC GGGCGGCGGT CGGGGTTGTT TCTGGCGGAG 
CGTGGCACCG CCCGCC6TCG CCCGCC6CCA GCCCCAACAA A6ACCGCCTC 

10351 GTGCTGCTGA TGATCTAATT AAAGTAGGCG GTCTTGAGAC GGCGGATGGT 
CACGACGACT ACTACATTAA TTTCATCCGC CAGAACTCTG CCGCCTACCA 
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10401 CGACAGAAGC A||^TGTCCT TGGGTCCGGC CTGCTGAA^G'^Cd<::AGSft|T 
CCTGTCTTCG otRaCAGGA ACCCAGGCCG GACGACTTAC GCGTCcWIa 

10451 CGGCCATGCC CCAGGCTTCG TTTTGACATC GGCGCAGGTC tttgtagtag 

gccggtacgg gistccgaagc aaaactgtag ccgcgtccag aaacatcatc 

10501 tcttgcatga gcctttctac cggcacttct tcttctcctt cctcttgtcc 
agaacgtact cggaaagatg gccgtgaaga agaagaggaa ggagaacagg 

10551 TGCATCTCTT GCATCTATCG CTGCGGCGGC GGCGGAGTTT GGCCGTAGGT 

acgtagagaa cgtagatagc gacgccgccg ccgcctcaaa ccggcatcca 
10601 ggcgccctct tcctcccatg cgtgtgaccc cgaagcccct catcggctga 

CC6C6GGAGA AGGA6G6TAC GCACACTG6G GCTTCGGGGA GTAGCCGACT 

10651 A6CAGGGCTA GGTCGGCGAC AACGCGCTCG GCTAATATGG CCTGCTGCAC 
TCGTCCCGAT CCAGCCGCTG TTGCGCGAGC CGATTATACC GGACGACGTG 

10701 CTGCGTGAGG GTAGACTGGA AGTCATCCAT GTCCACAAAG CGGTGGTATG 
GACGCACTCC CATCTGACCT TCA6TAGGTA CAGGTGTTTC GCCACCATAC 

10751 CGCCCGTGTT GATGGTGTAA GTGCAGTTGG CCATAACGGA CCAGTTAACG 
GCGGGCACAA CTACCACATT CACGTCAACC GGTATTGCCT 6GTCAATTGC 

10801 GTCTGGTGAC CCGGCTGCGA GAGCTCGGTG TACCTGAGAC GCGAGTAAGC 
CAGACCACTG GGCCGACGCT CTCGAGCCAC ATGGACTCTG CGCTCATTCG 

10851 CCTCGAGTCA AATACGTAGT CGTTGCAAGT CCGCACCAGG TACTGGTATC 
GGAGCTCAGT TTATGCATCA GCAACGTTCA GGCGTGGTCC ATGACCATAG 

10901 CCACCAAAAA GTGCGGCGGC GGCTGGCGGT AGAGGGGCCA GCGTAGGGTG 
GGTGGTTTTT GACGCCGCCG CCGACCGCCA TCTCCCCGGT CGCATCCCAC 

10951 GCCGGGGCTC CGGGGGCGAG ATCTTCCAAC ATAAGGCGAT GATATCCGTA 
CGGCCCCGAG GCCCCCGCTC TAGAAGGTTG TATTCCGCTA CTATAGGCAT 

11001 GATGTACCTG GACATCCAGG TGATGCCGGC GGCGGTGGTG GAGGCGCGCG 
CTACATGGAC CTGTAGGTCC ACTACGGCCG CCGCCACCAC CTCCGCGCGC 

11051 GAAAGTCGCG GACGCGGTTC CAGATGTTGC GCAGCGGCAA AAAGTGCTCC 
CTTTCAGCGC CTGCGCCAAG GTCTACAACG CGTCGCCGTT TTTCACGAGG 

11101 ATGGTCGGGA CGCT^CTGGCC GGTCAGOCGC 6CGCAATCGT TGACGCTCTA 
TACCAGCCCT GCGAGACCGG CCAGTCC6CG CGCGTTAGCA ACTGCGAGAT 

11151 GACCGTGCAA AAGGAGAGCC TGTAAGC6G6 CACTCTTCCG TGGTCTGGTG 
CTGCCACGTT TTCCTCTC6G ACATTCGCCC 6TGAGAAGGC ACCAGACCAC 

11201 CATAAATTCG CAAGGGTATC ATGGCGGACG ACCGGGGTTC GAGCCCCGTA 
CTATTTAAGC GTTCCCATAG TACCGCCTGC TGGCCCCAAG CTCGGGGCAT 

11251 TCCGGCCGTC CGCCCTGATC CATGCGGTTA CCGCCCGCGT GTCGAACCCA 
AGGCCGGCAG GCGGCACTAG GTACGCCAAT GGCGGGCGCA CAGCTTGG6T 

11301 GGTGTGCGAC GTCAGACAAC G6GG6AGTGC TCCTTTTGGC TTCCTTCCAG 
CCACACGCTG CAGTCTGTTG CCCCCTCACG AGGAAAACCG AAGGAAGGTC 
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11351 GCGCCGCGGC T^pGCGCTA 6CTTTTTTGG CCACTGGCCG CGCGC^^BT 
. CGCGCCGCCG aRaCGCGAT CGAAAAAACC GGTGACCGGC GCGCGT^Ra 

11401 AAGCGGTTAG GCTGGAAAGC GAAAGCATTA AGTGGCTCGC TCCCTGTAGC 
TTCGCCAATC CGACCTTTCG CTTTCGTAAT TCACCGA6CG AGGGACATCG 

11451 CGGAGGGTTA TTTTCCAAGG GTT6AGTCGC GGGACCCCCG GTTCGAGTCT 
GCCTCCCAAT AAAAGGTTCC CAACTCA6CG CCCTGG6GGC CAAGCTCAGA 

11501 CGGACCGGCC GGACTGCGGC GAACGGGGGT TTGCCTCCCC GTCATGCAAG 
GCCTGGCCGG CCTGACGCCG CTTGCCCCCA AACGGAGGGG CAGTACGTTC 

11551 ACCCCGCTTG CAAATTCCTC CGGAAACAGG GACGAGCCCC TTTTTTGCTT 
TGGGGCGAAC GTTTAAGGA6 GCCTTTGTCC CTGCTCGGGG AAAAAACGAA 

11601 TTCCCAGATG CATCCGGTGC TGCGGCAGAT GCGCCCCCCT CCTCAGCAGC 
AAGGGTCTAC CTAGGCCACG ACGCCGTCTA CGCGGGGGGA GGAGTCGTCG 

11651 GGCAAGAGCA AGAGCAGCG6 CAGACATGCA GGGCACCCTC CCCTCCTCCT 
CCGTTCTCGT TCTCGTCGCC GTCTGTACGT CCCGTGGGAG 6GGAGGAGGA 

11701 ACCGCGTCAG GAGGGGCGAC ATCCGCGGTT GACGCGGCAG CAGATG6TGA 
TGGCGCAGTC CTCCCCGCTG TAGGCGCCAA CTGCGCCGTC GTCTACCACT 

11751 TTACGAACCC CCGCGGCGCC GGGCCCGGCA CTACCTGGAC TTGGAGGA6G 
AATGCTTGGG GGCGCCGCGG CCCGGGCCGT GATGGACCTG AACCTCCTCC 

11801 GCGAGGGCCT GGCGCGGCTA GGAGCGCCCT CTCCTGAGCG GCACCCAAGG 
CGCTCCCGGA CCGCGCCGAT CCTCGCGGGA GAGGACTCGC CGTGGCTTCC 

11851 GTGCAGCTGA AGCGTGATAC GCGTGAGGCG TACGTGCCGC GGCAGAACCT 
CACGTCGACT TCGCACTATG CGCACTCCGC atgcacggcg ccgtcttgga 

11901 GTTTCGCGAC CGCGAGGGAG AGGAGCCCGA GGAGATGCGG GATCGAAAGT 
CAAAGCGCTG GCGCTCCCTC TCCTCGGGCT cctctacgcc ctagctttca 

11951 tccacgcagg gcgcgagctg cggcatggcc tgaatcgcga gcggttgctg 
aggtgcgtcc cgcgctcgac gccgtaccgg acttagcgct cgccaacgac 

12001 cgcgaggagg actttgagcc cgacgcgcga accgggatta gtcccgcgcg 
gcgctcctcc tgaaactcgg gctgcgcgct tggccctaat cagggcgcgc 

12051 cgcacacgtg gcggccgccg acctggtaac cgcatacgag ca6acggtga 
gcgtgtgcac cgccggcggc tggaccattg gcgtatqctc gtctgccact 

12101 acca66agat taactttcaa aaaagcttta acaaccacgt gcgtacgctt 
tggtcctcta attgaaagtt ttttcgaaat tgttg6tgca cgcatgcgaa 

12151 gtggcgcgcg aggaggtggc tataggactc atgcatctgt gggactttgt 
ct^ccgcgcgc tcctccaccg atatcctgac tacgtagaca ccctgaaaca 

12201 aagcgcgctg gagcaaaacc caaatagcaa gccgctcatg gcgcagctgt 
ttcgcgcgac ctcgttttgg gtttatcgtt • cggcga6tac cgcgtcgaca 

12251 tccttatagt gcagcacagc agggacaacg aggcattcag ggatgcgctg 
aggaatatca cgtcgtgtcg tccctcttgc tccgtaagtc cctacgcgac 
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12301 CTAAACATAG T^^GCCCGA GGGCCGCTGG CTGCTCGATT TGATAAMfT 
GATTTGTATC AkScGGGCT CCCGGCGACC GACGAGCTAA ACTATtWPa 

12351 CCTGCAGAGC ATAGTGGTGC AGGAGCGCAG CTT6AGCCTG GCTGACAAGG 
GGACGTCTCG TATCACCACG TCCTCGCGTC 6AACTCGGAC CGACTGTTCC 

12401 TGGCCGCCAT CAACTATTCC ATGCTTAQCC TGGGCAAGTT TTACGCCCGC 
ACCGGCGGTA 6TTGATAAGG TACGAATCGG ACCCGTTCAA AATGCGGGCG 

12451 AAGATATACC ATACCCCTTA CGTTCCCATA GACAAGGAGG TAAAGATCGA 
TTCTATATGG TATGGGGAAT GCAAGGGTAT CTGTTCCTCC ATTTCTAGCT 

12501 GG6GTTCTAC ATGCGCATG6 CGCTGAAGGT GCTTACCTTG AGCGACGACC 
CCCCAAGATG TACGCGTACC GCGACTTCCA C6AATGGAAC TCGCTGCTGG 

12551 T6GGCGTTTA TCGCAACGAG CGCATCCACA AGGCCGTGAG CGTGAGCCGG 
ACCCGCAAAT AGCGTTGCTC GCGTAGGTGT TCCGGCACTC GCACTCGGCC 

12601 CGGCGCGAGC TCAGCGACCG CGAGCTGATG CACAGCCTGC AAAGGGCCCT 
GCCGCGCTCG AGTCGCTGGC GCTCGACTAC GTGTCGGACG TTTCCCGGGA 

12651 GGCTGGCACG GGCAGCGGCG ATAGAGAGGC CGAGTCCTAC TTTGACGCGG 
CCGACCGTGC CCGTCGCCGC TATCTCTCCG GCTCA6GATG AAACTGCGCC 

12701 GCGCTGACCT GCGCTGGGCC CCAAGCCGAC GCGCCCTCGA GGCAGCTGGG 
CGCGACTGGA CGCGACCCGG GGTTCGGCTG CGCGGGACCT CCGTCGACCC 

12751 GCCGGACCTG 6GCT6GCGGT GGCACCCGCG CGCGCTGGCA ACGTCGGCGG 
CGGCCTGGAC CCGACCGCCA CCGTGGGCGC GCGCGACCGT TGCAGCCGCC 

12801 CGTGGAGGAA TATGACGAGG ACGATGAGTA CGAGCCAGAG GACGGCGAGT 
GCACCTCCTT ATACTGCTCC TGCTACTCAT GCTCGGTCTC CTGCCGCTCA 

12851 AGTAAGCGGT GATGTTTCTG ATCAGATGAT GCAAGACGCA ACGGACCCGG 
TGATTCGCCA CTACAAAGAC TAGTCTACTA CGTTCTGCGT TGCCTGGGCC 

12901 CGGTGCGGGC GGCGCTGCAG AGCCAGCCGT CCGGCCTTAA CTCCACGGAC 
GCCACGCCCG CC6CGACGTC TCGGTCGGCA GGCCGGAATT GAGGTGCCTG 

12951 GACTGGCGCC AGGTCATGGA CCCCATCATG TCGCTGACTG CGCGCAATCC 
- CTGACCGCGG TCCAGTACCT GGCGTAGTAC AGCGACTGAC 6CGCGTTAGG 

13001 TGACGCGTTC CGGCAGCAGC CGCAGGCCAA CCGGCTCTCC GCAATTCTGG 
ACTGCGCT^G GCCGTCGTCG GCGTCCGGTT GGCCGAGAGG CGTTAAGACC 

13051 AAGCGGTGGT CCCGGCGCGC GCAAACCCCA CGCACGAGAA GGTGCTGGCG 
TTCGCCACCA GGGCCGCGCG CGTTTGGGGT GCGTGCTCrT CCACGACCGC 

13101 ATCGTAAACG CGCTGGCCGA AAACAGGGCC ATCCGGCCCG ACGAGGCCGG 
TAGCATTTGC GCGACCGGCT TTTGTCCCGG TAGGCCGGGC TGCTCCGGCC 

13151 CCTGGTCTAC GACGCGCTGC TTCAGCGCGT GGCTCGTTAC AACA6CGGCA 
GGACCAGATG CT6CGCGACG AAGTCGCGCA CCGAGCAATG TTGTCGCCGT 

13201 ACGTGCAGAC CAACCTG6AC CGGCTGGTGG GGGATGT6CG CGAGGCCGT6 
TGCACGTCTG 6TTGGACCTG GCCGACCACC CCCTACACGC GCTCCGGCAC 
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13251 



GCGCAGCGTG 



ICGCGCA GCAGCAGGGC AACCTGGGCT CCA' 




CGCGTCGCAC TcBcGCGCGT CGTCGTCCCG TTGGACCCGA GGTACCMCG 

13301 ACTAAACGCC TTCCTGAGTA CACAGCCCGC CAACGTGCCG CGGGGACA6G 
TGATTTGCGG AAGGACTCAT GTGTCGGGCG GTTGCACGGC GCCCCTGTCC 

13351 AGGACTACAC CAACTTTGTG AGCGCACTGC GGCTAATGGT GACTGAGACA 
TCCTGATGTG GTTGAAACAC TCGCGTGACG CCGATTACCA CTGACTCTGT 

13401 CCGCAAA6TG AGGTGTACCA GTCTGGGCCA GACTATTTTT TCCAGACCAG 
GGCGTTTCAC TCCACATGGT CAGACCCGGT CTGATAAAAA AGGTCTGGTC 

13451 TAGACAAGGC CTGCAGACCG TAAACCTGAG CCAGGCTTTC AAAAACTTGC 
ATCTGTTCCG GACGTCTGGC ATTTGGACTC GGTCCGAAA6 TTTTT6AACG 

13501 AG6GGCTGTG GGGGGTGCGG GCTCCCACAG GCGACCGCGC GACCGTGTCT 
TCCCCGACAC CXICCCACGCC CGAGGGTGTC CCCTGGCGCG CTGGCACAGA 

13551 AGCTTGCTGA CGCCCAACTC GC6CCTGTTG CTGCTGCTAA TAGCGCCCTT 
TCGAACGACT GC66GTTGAG CGCGGACAAC GACGACGATT ATCGCGGGAA 

13601 CACGGACAGT GGCAGCGTGT CCCGGGACAC ATACCTAGGT CACTT6CTGA 
GTGCCTGTCA CCGTCGCACA GGGCCCTGTG TATGGATCCA GTGAACGACT 

13651 CACTGTACCG CGAGGCCATA GGTCAGGCGC ATGTGGACGA GCATACTTTC 
GTGACATGGC GCTCCGGTAT CCAGTCCGCG TACACCTGCT CGTATGAAAG 

13701 CAGGAGATTA CAAGTGTCAG CCGCGCGCTG GGGCAGGA6G ACACGGGCAG 
GTCCTCTAAT GTTCACAGTC GGCGCGCGAC CCCGTCCTCC TGTGCCCGTC 

13751 CCTGGAGGCA ACCCTAAACT ACCTGCTGAC CAACCGGCGG CAGAAGATCC 
GGACCTCCGT TGGGATTTGA TGGACGACTG GTTGGCCGCC GTCTTCTAGG 

13801 CCTCGTTGCA CAGTTTAAAC AGCGAGGAGG AGCGCATTTT 6CGCTACGTG 
GGAGCAACGT GTCAAATTTG TCGCTCCTCC TCGCGTAAAA CGCGATGCAC 

13851 CAGCAGAGCG TGAGCCTTAA CCTGATGCGC GACGGGGTAA cgcccagcgt 

gtcgtctcgc actcggaatt ggactacgcg ctgccccatt gcgggtcgca 

13901 ggcgctggac atgaccgcgc gcaacatgga accgggcatg tatgcctcaa 
ccgcgacctg tactggcgcg cgttgtacct tggcccgtac atacggagtt 

13951 accggccgtt tatcaaccgc ctaatggact acttgcatcg cgcggccgcc 
tggccggcaa atagttggcg gattacctga tgaacgtagc gcgccggcgg 

14001 6tgaaccccg agtatttcac caatgccatc ttgaacccgc actccctacc 

CACTTGGGGC TCATAAAGTG G1TACGGTA6 AACTTGGGCG TGACCGATGG 

14051 GCCCCCTGGT TTCTACACCG GGGGATTCGA GGTGCCCGAG GGTAACGATG 
CGGGGGACCA AAGATGTGGC CCCCTAAGCT CCACGGGCTC CCATTGCTAC 

14101 GATTCCTCTG GGACGACATA GACGACAGCG TGTTTTCCCC GCAACCGCAG 
CTAAGGAGAC CCTGCTGTAT CTGCTGTCGC ACAAAAGGGG CGTTGGCGTC 

14151 ACCCTGCTAG AGTTGCAACA GCGCGAGCAG GCAQAGGCGG CGCTGCGAAA 
TGGGACGATC TCAACGTTGT CGCGCTCGTC CGTCTCCGCC GCGACGCTTT 
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14201 GGAAAGCTTC Mj^GGCCAA GCAGCTTGTC CGATCTAGfeC:^C?rGcgj|tC 
CCTTTCGAAG oK^CGGTT CGTCGAACAG 6CTAGATCCG CGACGC^Pg 

14251 CGCGGTCAGA TGCTAGTAGC CCATTTCCAA GCTTGATAGG GTCTCTTACC 
GCGCCAGTCT ACGATCATCG GGTAAAGGTT CGAACTATCC CAGAGAATGG 

14301 AGCACTCGCA CCACCCGCCC GCGCCTGCTG GGCGAGGAGG agtacctaaa 

tcgtgagcgt ggtgggcggg cgcggacgac ccgctcctcc tcatggattt 

14351 caactcgctg ctgcagccgc agcgcgaaaa aaacctgcct ccggcatttc 
gttgagcgac gacgtcggcg tcgcgctttt tttggacgga ggccgtaaag 

14401 ccaacaacgg gatagagagc ctagtggaca agatgagtag atggaagacg 
ggttgttgcc ctatctctcg gatcacctgt tctactcatc taccttctgc 

14451 TACGCGCAGG AGCACAGGGA CGTGCCAGGC CCGCGCCCGC CCACCCGTCG 
ATGCGCGTCC TCGTGTCCCT GCACGGTCCG GGCGCGGGCG GGTGGGCAGC 

14501 tcaaaggcac gaccgtcagc ggggtct6gt gtgggaggac gatgactcgg 
agtttccgtg ctggcagtcg ccccagacca caccctcctg ctactgagcc 

14551 CAGACGACAG CA6CGTCCTG GATTTGGGAG GGAGTGGCAA CCCGTTTGCG 
GTCTGCTGTC GTCGCAGGAC CTAAACCCTC CCTCACCGTT gggcaaacgc 

14601 caccttcgcc ccaggctggg gagaatgttt taaaaaaaaa aaaagcatga 
GTGGAAGCGG ggtccgaccc ctcttacaaa attttttttt ttttcgtact 

14651 tgcaaaataa aaaactcacc aaggccatgg caccgagcgt tggttttctt 

ACGTTTTATT TTTTGAGTGG TTCCGGTACC GTGGCTCGCA ACCAAAAGAA 
14701 GTATTCCCCT TAGTATGCGG CGCGCGGCGA TGTATGAGGA AGGTCCTCCT 

cataagggga atcatacgcc gcgcgccgct acatactcct tccaggagga 

14751 ccctcctacg agagtgtggt gagcgcggcg ccagtggcgg cggcgctggg 
gggaggatgc tctcacacca ctcgcgccgc ggtcaccgcc gccgcgaccc 

14801 ttctcccttc gatgctcccc tggacccgcc gtttgtgcct ccgcggtacc 
aagagggaag ctacgagggg acctgggcgg caaacacgga 6gcgccatgg 

14851 TGCGGCCTAC CGGGGGGAGA AACAGCATCC GTTACTCTGA GTTGGCACCC 
ACGCCGGATG GCCCCCCTCT TTGTCGTAGG CAATGAGACT CAACCGTGGG 

14901 CTATTCGACA CCACCCGTGT GTACCTGGTG GACAACAAGT CAACGGATGT 
GATAAGCTGT GGTGGGCACA CATGGACCAX: CTGTTGTTCA GTTGCCTACA 

14951 GGCATCCCTG AACTACCAGA ACGACCACAG CAACTTTCTG ACCACGGTCA 
CCGTAGGGAC TT6ATGGTCT TGCTGGTGTC GTTGAAAGAC TGGTGCCAGT 

15001 TTCAAAACAA TGACTACAGC CCGGGGGAGG CAAGCACACA GACCATCAAT 
AAGTTTTGTT ACTGATGTCG GGCCCCCTCC GTTCGTGTGT CTGGTAGTTA 

15051 CTTGACGACC GGTCGCACTG G6GCGGCGAC CTGAAAACCA TCCTGCATAC 
GAACTGCTGG CCA6CGTGAC CCCGCCGCTG GACTTTTGGT AGGACGTATG 

15101 CAACATGCCA AATGTGAACG AGTTCATGTT TACCAATAAG TTTAAGGCGC 
GTTGTACGGT TTACACTTGC TCAAGTACAA ATGGTTATTC AAATTCCGCG 
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15151 GGGTGATGGT d||^CGCTTG CCTACTAAGG ACMTCA6GT GGAGCl^piA 
CCCACTACCA CAGCGCGAAC GGATGATTCC TGTTAGTCCA CCTCGACTTT 

15201 TACGAGTGGG TGGAGTTCAC GCTGCCCGAG GGCAACTACT CCGAGACCAT 
ATGCTCACCC ACCTCAAGTG CGACGGGCTC CCGTTGATGA GGCTCTGGTA 

15251 GACCATAGAC CTTATGAACA ACGCGATCGT GGAGCACTAC TTGAAAGTGG 
CTGGTATCTG GAATACTTGT TGCGCTAGCA CCTCGTGATG AACTTTCACC 

15301 GCAGACAGAA CGGGGTTCTG GAAAGCGACA TCGGGGTAAA GTTTGACACC 
CGTCTGTCTT GCCCCAAGAC CTTTCGCTGT AGCCCCATTT CAAACTGTGG 

15351 CGCAACTTCA GACTGGGGTT TGACCCCGTC ACTGGTCTTG TCATGCCTGG 
GCGTTGAAGT CT6ACCCCAA ACT6GGGCAG TGACCAGAAC AGTACGGACC 

15401 GGTATATACA AACGAAGCCT TCCATCCA6A CATCATTTTG CT6CCAGGAT 
CCATATATGT TTGCTTCGGA AGGTAGCTCT GTAGTAAAAC GACGGTCCTA 

15451 GCGGGGTGGA CTTCACCCAC AGCCGCCTGA GCAACTTGTT GGGCATCCGC 
CGCCCCACCT GAAGTGGGTG TCGGCGGACT CGTTGAACAA CCCGTAGGCG 

15501 AAGCGGCAAC CCTTCCAGGA GGGCTTTAGG ATCACCTACG ATGATCTGGA 
TTCGCCGTT6 GGAAGGTCCT CCCGAAATCC TAGTCGATGC TACTAGACCT 

15551 GGGTGGTAAC ATTCCCGCAC TGTTGCATOT GGACGCCTAC CAGGCGAGCT 
CCCACCATTG TAAGGGCGTG ACAACCTACA CCTGCGGATG GTCCGCTCGA 

15601 TGAAAGATGA CACCGAACAG GGCGGGGGTG GCGCAGGCGG CAGCAACAGC 
ACTTTCTACT GTGGCTTGTC CCGCCCCCAC CGCGTCCGCC GTCGTTGTCG 

15651 AGTGGCAGCG GCGCGGAAGA GAACTCCAAC GCGGCAGCCG CGGCAATGCA 
TCACCGTCGC CGCGCCTTCT CTTGAGGTTG CGCCGTCGGC GCCGTTACGT 

15701 GCCGGTGGAG GACATGAACG ATCATGCCAT TCGCGGCGAC ACCTTTGCCA 
CGGCCACCTC CTGTACTTGC TAGTACGGTA AGCGCCGCTG TGGAAACGGT 

157 51 CACGGGCTGA GGAGAAGCGC GCTGAGGCCG AAGCAGCGGC CGAAGCTGCC 
GTGCCCGACT CCTCTTCGCG CGACTCCGGC TTCGTCGCCG GCTTCGACGG 

15801 CCCCCCGCTG CGCAACCCGA GGTCGAGAAG CCTCAGAAGA AACCGGTGAT 
CGGG6GCGAC GC6TTGG6CT CCAGCTCTTC GGAGTCTTCT TTGGCCACTA 

15851 CAAACCCCTG ACAGAGGACA GCAAGAAACG CAGTTACAAC CTAATAAGCA 
GTTTGGGGAC TGTCTCCTGT CGTTCTTTGC GTCAATGTTG GATTATTCGT 

15901 ATGACAGCAC CTTCACCCAG TACCGCAGCT GGTACCTTGC ATACAACTAC 
TACTGTCGTG GAAGTGGGTC ATGGCGTCGA CCATGGAACG TATGTTGATG 

15951 GGCGACCCTC AGACCGGAAT CCGCTCATGG ACCCTGCTTT GCACTCCTGA 
CCCCTGGGAG TCTGGCCTTA GGCGAGTACC TGGGACGAAA CGTGAGGACT 

160.01 CGTAACCTGC. 6GCTCGGAGC AGGTCTACTG GTCGTTGCCA GACAT6ATGC 
GCATTGGACG CCGAGCCTCG TCCAGATGAC CAGCAACGGT CT6TACTACG 

16051 AAGACCCCGT GACCTTCCGC TCCACGCGCC AGATCAGCAA CTTTCCGGTG 
TTCTGGGGCA CTGGAJVGGCG AGGTGCGCGG TCTAGTCGTT GAAAGGCCAC 
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26101 GTGGGCGCCG ;^rGTTGCC CGTGCACTCC AAGAGCTTCmCSACGJjKA ~ 
CACCCGCGGC 'SRcAACGG GCACGTGAGG TTCTCGAA6A TGTTGC^It 

16151 GGCCGTCTAC TCCCAACTCA TCCGCCAGTT TACCTCTCTG ACCCACGTGT 
CC66CAGATG AG66TTCAGT AGGC6GTCAA ATGGAGAGAC T6G6TGCACA 

16201 TCAATCGCTT TCCCGAGAAC CA6ATTTT66 CGCGCCCGCC A6CCCCCACC 
AGTTA6CGAA AGGGCTCTTG GTCTAAAACC GCGCGGGCGG TCGGG6GTGG 

16251 ATCACCACCG TCAGTGAAAA CGTTCCTGCT CTCACAGATC ACGGGACGCT 
TAGTGGTGGC AGTCACTTTT GCAAGGACGA GAGTGTCTAG TGCCCTGCGA 

16301 ACCGCTGCGC AACAGCATCG GAGGAGTCCA GCGAGTGACC ATTACTGACG 
TG6CGACGCG TTGTCGTAGC CTCCTCAGGT CGCTCACTGG TAATGACTGC 

16351 CCAGACGCCG CACCTGCCCC TACGTTTACA AGGCCCTGGG CATAGTCTCG 
GGTCTGCGGC QTGGACOGGG AT6CAAATGT TCCGGGACCC GTATCAGAGC 

16401 CCGCGCGTCC TATCGAGCC6 CACTTTTTGA GCAAGCATGT CCATCCTTAT 
GGCGCGCAGG ATAGCTCGGC GTGAAAAACT CGTTCGTACA GGTAGGAATA 

16451 ATCGCCCAGC AATAACACAG GCTGGGGCCT GCGCTTCCCA AGCAAGATGT 
TAGCGGGTCG TTATTGTGTC CGACCCCGGA C6CGAAGGGT TCGTTCTACA 

16501 TTGGCGGGGC CAAGAAGCGC TCCGACCAAC ACCCAGTGCG CGTGCGCGGG 
AACCGCCCCG GTTCTTCGCG AGGCTGGTTG TGGGTCACGC GCACGCGCCC 

16551 CACTACCGCG CGCCCTGGGG CGCGCACAAA CGCGGCCGCA CTGGGCGCAC 
GTGATGGCGC GCGGGACCCC GCGCGTGTTT GCGCCGGCGT GACCCGCGTG 

16601 CACCGTCGAT GACGCCATCG ACGCGGTGGT GGAGGAGGCG CGCAACTACA 
GTGGCAGCTA CTGCGGTAGC TGCGCCACCA CCTCCTCCGC GCGTTGATGT 

16651 CGCCCACGCC GCCACCAGTG TCCACAGTGG ACGCGGCCAT TCAGACCGTG 
GCGGGTGCGG' CGGTGGTCAC AGGTGTCACC TGCGCCGGTA AGTCTGGCAC 

16701 GTGCGCGGAG CCCGGCGCTA TGCTAAAATG AAGAGACGGC GGAGGCGCGT 
CACGCGCCTC GGGCCGCGAT ACGATTTTAC TTCTCTGCCG CCTCCGCGCA 

16751 AGCACGTCGC CACCGCCGCC GACCCGGCAC TGCCGCCCAA CGCGCGGCGG 
TCGTGCAGCG GTGGCGGCGG CTGGGCCGTG ACGGC6GGTT GCGCGCCGCC 

16801 C6GCCCTGCT TAACCGCGCA CGTC6CACCG GCCGACGGGC GGCCATGCGG 
GCCGGGACGA ATTGGC6CGT GCAGCGTGGC CGGCTGCCCG CCGGTACGCC 

16B51 GCCGCTCGAA GGCTGGCCGC GGGTATTGTC ACTGTGCCCC CCAGGTCCAG 
CGGCGAGCTT CCGACCGGCG CCCATAACAG TGACACGGGG GGTCCAGGTC 

16901 GCGACGAGCG GCCGCCGCAG CAGCCGCX5GC CATTAGTGCT ATGACTCAGG 
CGCTGCTCGC CGGCGGCGTC CTCGGCGCCG GTAATCACGA TACTGAGTCC 

16951 GTCGCAGGGG CAACGTGTAT TGGGTGCGC6 ACTCGGTTA6 CGGCCTGCGC 
CAGCGTCCCC 6TTGCACATA ACCCACGCGC T6AGCCAATC GCCGGACGCG 

17001 GTGCCCGTGC GCACGCGCCC CCCGCGCAAC TAGATTGCAA 6AAAAAACTA 
CACGGGCACG CGTGGGCGGG GGGCGCGTTG ATCTAACGTT CTTTTTTGAT 
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17051 CTTAGACTCG l^pGTTGTA TGTATCCAGC GCCGGCCiCiCU UUCAAU^pti 
GAATCTGAGC A^RcAACAT ACATAGGTCG CCGCCGCCGC GCGTTGOTC 

17101 CTATGTCCAA GCGCAAAATC AAAGAAGAGA TGCTCCAGGT CATC6CGCCG 
GATACAGGTT CGCGTTTTAG TTTCTTCTCT ACGAGGTCCA GTA6CGCGGC 

17151 GAGATCTATG GCCCCCCGAA GAAGGAAGAG CAGGATTACA AGCCCCGAAA 
CTCTAGATAC CGGGGGGCTT CTTCCTTCTC GTCCTAATGT TCGGGGCTTT 



17201 GCTAAAGCGG GTCAAAAAGA AAAAGAAAGA TGATGATGAT GAACTTGACG 
CGATTTCGCC CAGTTTTTCT TTTTCTTTCT ACTACTACTA CTTGAACTGC 

17251 ACGAGGTGGA ACTGCTGCAC GCTACCGCGC CCAGGCGACG GGTACAGTGG 
TGCTCCACCT TGACGACGTG CGATGGCGCG GGTCCGCTGC CCATGTCACC 

17301 AAAGGTCQAC QCGTAAAACG TGTTTTGCGA CCCGGCACCA CCGTACTCTT 
TTTCCAGCTG CGCATTTTGC ACAAAACGCT GGGCCGTGGT GGCATCAGAA 

17351 TACGCCCGGT GAGC6CTCCA CCCGCACCTA CAAGCGCGTG TATGATGAGG 
ATGCGGGCCA CTC6CGAGGT GGGCGTGGAT GTTCGCGCAC ATACTACTCC 

17401 TGTACGGCGA CGAGGACCTG CTTGAGCAGG CCAACGAGCG CCTCGGGGAG 
ACATGCCGCT GCTCCTGGAC GAACTCGTCC GGTTGCTCGC GGAGCCCCTC 

17451 TTTGCCTAC6 GAAAGCGGCA TAAGGACATG CTGGCGTTGC CGCTGGACGA 
AAACGGATGC CTTTCGCCGT ATTCCTGTAC GACCGCAACG GCGACCTGCT 

17501 GGGCAACCCA ACACCTAGCC TAAAGCCCGT AACACTGCAG CAGGTGCTGC 
CCCGTTGGGT TGTGGATCGG ATTTCGGGCA TTGTGACGTC GTCCACGACG 

17551 CCGCGCTTGC ACCGTCCGAA GAAAAGCGCG GCCTAAAGCG CGAGTCTGGT 
GGCGCGAACG TGGCAGGCTT CTTTTCGCGC CGGATTTCGC 6CTCAGACCA 

.17601 GACTTGGCAC CCACCGTGCA GCTGATGGTA CCCAAGCGCC AGCGACTGGA 
CTGAACCGTG GGTGGCACGT CGACTACCAT GGGTTCGCGG TCGCTGACCT 

17651 AGATGTCTTG GAAAAAATGA CCGTGGAACC TGGGCTGGAG CCCGAGGTCC 
TCTACAGAAC CTTTTTTACT GGCACCTTGG ACCCGACCTC GGGCTCCAGG 

17701 GC6TGCGGCC AATCAAGCAG GTGGCGCCGG GACTGGGCGT GCAGACCGTG 
CGCACGCCGG TTAGTTCGTC CACCGCG6CC CTGACCCGCA CGTCTGGCAC 

17751 GACGTTCAGA TACCCACTAC CAGTAGCACC AGTATTGCCA CCGCCACA6A 
CTGCAAGTCT ATGGGTGATG GTCATCGTGG TCATAACGGT 6GCGGTGTCT 

17801 GGGCATGGAG ACACAAACGT CCCCGGTTGC CTCAGCGGTG GCGGATGCCG 
CCCGTACCTC TGTGTTTGCA CCGGCCAACG GAGTCGCCAC CGCCTACGGC 

17851 CGGTGCAGGC GGTCGCTGCG GCCGCGTCCA AGACCTCTAC GGAGGTGCAA 
GCCACGTCCG CCAGCGACGC CGGCGCAGGT TCTGGAGATG CXTTCCyiCGTT 

17901 ACGGACCCGT GGATGTTTCG CGTTTCAGCC CCCCGGCGCC CGCGCCGTTC 
TGCCTGGGCA CCTACAAAGC GCAAAGTCGG GGGGCCGCGG GCGCGGCAAG 

17951 GAGGAAGTAC GGCGCCGCCA GCGCGCTACT CCCCGAATAT GCCCTACATC 
CTCCTTCATG CCGCGGCGGT CGCGCGATGA CGGGCTTATA CGGGATGTAG 
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18001 CTTCCATTGC G^TACCCCC 0GCTATC6TG GCTACACOPfilLGeGGCQ^jcGA 
GAAGGTAACG d^P^TOGGGG CCGATAGCAC CGATGTGGAT GGCGGofl^T 

18051 AGACGAGCAA CTACCCGACG CCGAACCACC ACTGGAACCC GCCGCCGCCG 
TCTGCTCGTT 6ATGGGCTGC GGCTTGGTGG TGACCTTGGG CGGCGGCGGC 

18101 TCGCCGTCGC CAGCCCGTGC TGGCCCCGAT TTCCGTGCGC AGGGTGGCTC 
AGCGGCAGCG GTCGGGCACG ACCGGGGCTA AAGGCACGCG TCCCACCGAG 



18151 GCGAAGGAGG CAGGACCCT6 GTGCTGCCAA CAGCGCGCTA CCACCCCAGC 
CGCTTCCTCC 6TCCTGGGAC CACGAC66TT GTCGCGCGAT GGTGGGGTCG 

18201 ATCGTTTAWk AGCCGGTCTT TGTGGTTCTT GCAGATATGG CCCTCACCTG 
TAGCAAATTT TCGGCCAGAA ACACCAAGAA CGTCTATACC GGGAGTGGAC 

18251 CCGCCTCCGT TTCCCGGTGC CGGQATTCCG AGGAAGAATG CACCGTAGGA 
GGCGGAGGCA AAGGGCCACG GCCCTAAGGC TCCTTCTTAC GTG6CATCCT 

18301 GGGGCATGGC CGGCCACGGC CTGACGGGCG GCATGCGTCG TGCGCACCAC 
CCCCGTACCG GCCGGTGCCG GACTGCCCGC CGTACGCAGC ACGCGTGGTG 

18351 CGGCGGCGGC GCGCGTCGCA CCGTCGCATG CGCGGCGGTA TCCTGCCCCT 
GCCGCCGCCG CGCGCAGCGT GGCAGCGTAC GCGCCGCCAT AGGAC6GGGA 

18401 CCTTATTCCA CTGATCGCCG C66CGATTGG CGCCGTGCCC GGAATTGCAT 
GGAATAAGGT GACTAGCG6C GCCGCTAACC GCGGCACGGG CCTTAACGTA 

18451 CCGTGGCCTT GCAGGCGCAG AGACACTGAT TAAAAACAAG TTGCATGTGG 
GGCACCGGAA CGTCCGCGTC TCTGTGACTA ATTTTTGTTC T^CGTACACC 

IBS 01 AAAAATCAAA ATAAAAAGTC TGGACTCTCA CGCTCGCTTG GTCCTGTAAC 
TTTTTAGTTT TATTTTTCAG ACCTGASAGT GCGA6CGAAC CAG6ACATTG 

18551 TATTrTGTAG AATGGAAGAC ATCAACTTTG CGTCTCTGGC CCCGCGACAC 
ATAAAACATC TTACCTTCTG TAGTTGAAAC GCAGAGACCG GGGCGCTGTG 

18601 GGCTCGCGCC CGTTCATGGG AAACT6GCAA GATATCGGCA CCAGCAATAT 
CCGAGCGCGG GCAAGTACCC TTTGACCGTT CTATAGCCGT GGTCGTTATA 

18651 GAGCGGTGGC GCCTTCAGCT GGGGCTCGCT GTGGAGCQOC ATTAAAAATT 
CTCGCCACCG CGGTkAGTCGA CCCCGAGCGA CACCTCGCCG TAATTTTTAA 

18701 TCGGTTCCAC CGTTAAGAAC TATGGCAGGA AGGCCTGGAA CAGCAGCACA 
AGCCAAGGTG GCAATTCTTG ATACCGTCGT TCCGGACCTT GTCGTCGTGT 

18751 GGCCAGATGC TGAGGGATAA CTTGAAAGAG CAAAATTTCC AACAAAAGGT 
CCG6TCTACG ACTCCCTATT CAACTTTCTC GTTTTAAAGG TTGTTTTCCA 

18801 GGTAGATGGC CTG6CCTCTG GCATTAGCGG GGTGGTGGAC CTGGCCAACC 
CCATCTACCG C3ACCGGAGAC CGTAATCGCC CCACCACCTG GACCGGTTGG 

18851 AGGCAGTGCA AAATAAGATT AACAGTAAGC TTGATCCCCG CCCTCCCGTA 
TCCGTCACGT rPTATTCTAA TTGTCATTCG AACTAGGGGC GGGAGGGCAT 

18901 GAGGAGCCTC CACCQGCCGT GGAGACAGTG TCTCCAGAGG GGCGTGGCGA 
CTCCTCGGAG GTGGCCGGCA CCTCTGTCAC AGAGGTCTCC CCGCACCGCT 
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1B951 AAAGCGTCCG C^CCGACA GGGAAGAAAC TCTGGTGACG"CAAATA|^G 
TTTCGCAGGC G^BgGCTGT CCCTTCTTTG AGACCACTGC GTTTAtHSc 

19001 AGCCTCCCTC GTACGAGGAG GCACTAAAGC AAGGCCTGCC CACCACCCGT 
TCGGAGGGAG CATGCTCCTC CGTGATTTCG TTCCGGACGG GTGGTGGGCA 

19051 CCCATCGCGC CCATGGCTAC CGGAGTGCTG GGCCAGCACA cacccgtaac 

gggtagcgcg ggtaccgatg gcctcacgac ccggtcgtgt gtgggcattg 



19101 gctggacctg cctccccccg ccgacaccca gcagaaacct gtcctgccag. 

C6ACCT6GAC 6GAGGGGGGC GGCTGTGGGT CGTCTTTGGA CACGACGGTC 

19151 gcccgaccgc cgttgttgta acccgtccta gccgcgcgtc cctgcgccgc 
cgggctggcg gcaacaacat tgggcaggat cggcgcgcag ggacgcggcg 

19201 gccgccagcg gtccgcgatc gttgcggccc gtagccagtg gcaactggca 
cggcggtcgc caggcgctag caacgccggg. catcggtcac cgttgaccgt 

19251 aagcacactg aacagcatcg tgggtctggg ggtgcaatcc ctgaagcgcc 
ttcgtgtgac ttgtcgtagc acccagaccc ccacgttagg gacttcgcgg 

19301 gacgatgctt ctgatagcta acgtgtcgta tgtgtgtcat gtatgcgtcc 

CTGCTACGAA GACTATCGAT TGCACAGCAT ACACACAGTA CATACGCAGG 

19351 ATGTCGCCGC CAGAGGAGCT GCTGAGCCGC CGCGCGCCCG CTTTCCAAGA 
TACAGCGGCG GTCTCCTCGA CGACTCGGCG GCGCGCGGGC GAAAGGTTCT 

19401 TGGCTACCCC TTCGATGATG CCGCAGTGGT CTTACATGCA CATCTCGGGC 
ACCGATGGGG AAGCTACTAC GGCGTCACCA GAATGTACGT GTAGAGCCCG 

19451 CAGGACGCCT CGGAGTACCT GAGCCCCGGG CTGGTGCAGT TTGCCCGCGC 
CTCCTGCGGA GCCTCATGGA CTCGGGGCCC GACCACGTCA AACGGGCGCG 

19501 CACCGAGACG TACTTCAGCC TGAATAACAA GTTTAGAAAC CCCACGGTG6 
GTGGCTCTGC ATGAAGTCGG ACTTATTGTT CAAATCTTTG GGGTGCCACC 

19551 CGCCTACGCA CGACGTGACC ACAGACCGGT CCCAGCGTTT GACGCTGCGG 
GCGGATGCGT GCTGCACTGG TGTCTGGCCA GGGTCGCAAA CTGCGACGCC 

19501 TTCATCCCTG TGGACCGTGA GGATACTGCG TACTCGTACA AGGCGCGGTT 
AAGTAGGGAC ACCTGGCACT CCTATGACGC ATGAGCATGT TCCGCGCCAA 

19651 CACCCTAGCT GT6GGTGATA ACCGTGTGCT GGACATGGCT TCCACGTACT 
GTGGGATC6A CACCCACTAT TGGCACAC6A CCTGTACCGA AGGTGCATGA 

19701 TTGACATCCG CGGCGTGCTG GACAGGGGCC CTACTTTTAA 6CCCTACTCT 
AACTGTAGGC GCCOCACQAC CTGTCCCCGG GATGAAAATT CGGGATGAGA 

19751 GGCACTGCCT ACAACGCCCT GGCTCCCAAG GGTGCCCCAA ATCCTTGCGA 
CC6TGACGGA TGTTGCGGGA CCGAGGGTTC CCACGG66TT TAGGAACGCT 

19B01 ATGGGATGAA CCTGCTACTG CTCTTGAAAT AAACCTAGAA GAAGAGGACG 
TACCCTACTT CGACGATGAC GAGAACTTTA TTTGGATCTT CTTCTCCTGC 

19851 ATGACAACGA AGACGAAGTA GACGAGCAAG CTGAGCAGCA AAAAACTCAC 
TACTGTTGCT TCTGCTTCAT CTGCTCGTTC GACTCGTCGT TTTTTGAGTG 




75/144 



wo 02/22080 



PCT/USOl/28861 



19901 • GTATTTGGGC /.^KGCCTTA TTCTGGTATA AATATTACAA AGGAGq«T 
CATAAACCCG '^GCGGAAT AAGACCATAT TTATAATGTT TCCTCCOTTA 

19951 TCAAATAGGT GTCGAAGGTC AAACACCTAA ATATGCCGAT AAAACATTTC 
AGTTTATCCA CAGCTTCCAG TTTGTGGATT TATACGGCTA TTTTGTAAAG 

20001 AACCTGAACC TCAAATAGGA GAATCTCAGT GGTACGAAAC AGAAATTAAT 
TT6GACTTGG AGTTTATCCT CTTA6AGTCA CCATGCTTT6 TCTTTAATTA 

20051 CATGCAGCTG GGAGAGTCCT AAAAAAGACT ACCCCAATGA AACCATGTTA 
GTACGTCGAC CCTCTCAGGA TTTTTTCTGA TGGGGTTACT TTG6TACAAT 

20101 CGGTTCATAT GCAAAACCCA CAAATGAAAA TGGAGGGCAA 6GCATTCTTG 
GCCAAGTATA CGTTTTGGGT GTTTACTTTT ACCTCCCGTT CC6TAAGAAC 

20151 TAAAGCAACA AAATGGAAAG CTAGAAAGTC AAGTGGAAAT GCAATTTTTC 
ATTTCGTTGT TTTACCTTTC GATCTTTCAG TTCACCTTTA CGTTAAAAAG 

20201 TCAACTACTG AGGCAGCCGC AGGCAATGGT GATAACTTGA CTCCTAAAGT 
AGTTGATGAC TCCGTCGGCG TCC6TTACCA CTATTGAACT GAGGATTTCA 

20251 GGTATTGTAC AGTGAAGATG TAGATATAGA AACCCCAGAC ACTCATATTT 
CCATAACATG TCACTTCTAC ATCTATATCT TTGGGGTCTG TGAGTATAAA 

20301 CTTACATGCC CACTATTAAG GAAGGTAACT CACGAGAACT AATGGGCCAA 
GAATGTACGG GTGATAATTC CTTCCATTGA GTGCTCTTGA TTACCCGGTT 

20351 CAATCTATGC CCAACAGGCC TAATTACATT GCTTTTAGGG ACAATTTTAT 
GTTAGATACG GGTTGTCCGG ATTAATGTAA CGAAAATCCC TGTTAAAATA 

20401 TGGTCTAATG TATTACAACA GCACGGGTAA TATGGGTGTT CTGGCGGGCC 
ACCAGATTAC ATAATGTTGT CGTGCCCATT ATACCCACAA GACCGCCCGG 

20451 AAGCATCGCA GTT6AATGCT GTTGTAGATT TGCAAGACAG AAACACAGAG 
TTCGTAGCGT CAACTTACGA CAACATCTAA ACGTTCTGTC TTTGTGTCTC 

20501 CTTTCATACC AGCTTTTGCT TGATTCCATT GGTGATAGAA CCAGGTACTT 
GAAAGTATGG TC6AAAACGA ACTAAGGTAA CCACTATCTT GGTCCATGAA 

20551 TTCTATGTGG AATCAGGCTG TTGACAGCTA TGATCCAGAT GTTAGAATTA 
AAGATACACC TTAGTCCGAC AACTGTCGAT ACTAGGTCTA CAATCTTAAT 

20601 TTGAAAATCA TG6AACTGAA GATGAACTTP CAAATTACTG CTTTCCACTG 
AACTTTTAGT ACCTTGACTT CTACTTGAAG GTTTAATGAC GAAAGGTGAC 

20651 GGAGGTGTGA TTAATACAGA GACTCTTACC AAGGTAAAAC CTAAAACAGG 
CCTCCACACT AATTATGTCT CTGAGAATGG TTCCATTTTG GATTTTGTCC 

20*701 TCAGGA7AAT GGATGG6AAA AAGATGCTAC AGAATTTTCA GATAAAAATG 
AGTCCTTTTA CCTACCCTTT TTCTACGATG TCTTAAAAGT CTATTTTTAC 

20751 AAATAAGAGT TGGAAATAAT TTTGCCATX3G AAATCAATCT AAATGCCAAC 
TTTATTCTCA ACCTTTATTA AAACGGTACC TTTAGTTAGA TTTACGGTTG 

20801 CTGTGGAGAA ATTTCCTGTA CTCCAACATA GCGCTGTATT TGCCCGACAA 
GACACCTCTT TAAAGGACAT GAGGTTGTAT CGCGACATAA ACGGGCTGTT 
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20851 GCTAAAGTAC. M|CCTTCCA ACGTAAAAAT TTCTGAT;eAe-t:C;sASt^|tT 
CGATTTCATG tWtcAAGGT TGCATTTTTA AAGACTATTG GGTTTd^PA 

20901 ACGACTACAT GAACAA6CGA GTGGTGGCTC CCG6GCTAGT GGACT6CTAC 
TGCTGATGTA CTTGTTCGCT CACCACCGAG GGCCCGATCA CCTGACGATG 

20951 ATTAACCTTG GAGCACGCTG GTCCCTTGAC TATATGGACA ACGTCAACCC 
TAATTGGAAC CTCGTGC6AC CAGGGAACTG ATATACCTGT TGCAGTTGGG 

21001 ATTTAACCAC CACCGCAATG CTGGCCTGCG CTACCGCTCA ATGTTGCTGG 
TAAATTGGTG GTGGCGTTAC GACCGGACGC GATGGCGAGT TACAACGACC 

21051 GCAATGGTCG CTATGTGCCC TTCCACATCC AGGTGCCTCA GAAGTTCTTT 
CGTTACCAGC GATACACGGG AAGGTGTAGG TCCACGGAGT CTTCAAGAAA 

21X01 GCCATTAAAA ACCTCCTTCT CCTGCCGGGC TCATACACCT ACGAGTGGAA 
CGGTAATTTT TGGAGGAAGA GGACGGCCC6 AGTATGT6GA T6CTCACCTT 

21151 CTTCA6GAAG GAT6TTAACA TGGTTCTGCA GAGCTCCCTA GGAAATGACD 
GAAGTCCTTC CTACAATTGT ACCAAGAC6T CTCGAGGGAT CCTTTACTGG 

21201 TAAGGGTTGA CGGAGCCAGC ATTAACTTTG ATAGCATTTG CCTTTACGCC 
ATTCCCAACT GCCTCGGTCG TAATTCAAAC TATCGTAAAC GGAAATGCGG 

21251 ACCTTCTTCC CCATGGCCCA CAACACCGCC TCCACGCTTG AGGCCATGCT 
TGGAAGAAGG GGTACCGGGT GTTGTGGCGG AGGTGCGAAC TCCGGTACGA 

21301 TAGAAACGAC ACCAACGACC AGTCCTTTAA CGACTATCTC TCCGCCGCCA 
ATCTTTGCTG TGGTTGCTGG TCAGGAAATT GCTGATAGAG AGGCGGCGGT 

21351 ACATGCTCTA CCCTATACCC GCCAACGCTA CCAACGTGCC CATATCCATC 
TGTACGAGAT GGGATATGGG CGGTTGCGAT GGTTGCACGG GTATAGGTAG 

21401 CCCTCCCGCA ACTGGGCGGC TTTCCGCGGC TGGGCCTTCA CGCGCCTTAA 
GGGAGGGCGT TGACCCGCCG AAAGGCGCCG ACCCGGAAGT GCGCGGAATT 

21451 GACTAAGGAA ACCCCATCAC TGGGCTCGGG CTACGACCCT TATTACACCT 
CTGATTCCTT TGGGGTAGTG ACCCGAGCCC GATGCTGGGA ATAATGTGGA 

21501 ACTCTGGCTC TATACCCTAC CTAGATGGAA CCTTTTACCT CAACCACACC 
TGAGACCGAG ATATGGGATG GATCTACCTT GGAAAATGGA GTT6GTGTGG 

21551 TTTAAGAAG6 TGGCCATTAC CTTTGACTCT TCTGTCAGCT GGCCTGGCAA 
AAATTCTTCC ACCGGTAATG 6AAACTGAGA AGACAGTCGA CCGGACCGTT 

21601 TGACCGCCTG CTTACCCCCA ACGAGTTTGA AATTAAGCGC TCAGTTGACG 
ACTGGCGGAC GAATGGGGGT TGCTCAAACT TTAATTCGCG AGTCAACTGC 

21651 GGGAGGGTTA CAACGTTGCC CAGTGTAACA TGACCAAAGA CTGGTTCCTG 
CCCTCCCAAT GTTGCAACGG 6TCACATTGT ACTGGTTTCT GACCAAGGAC 

21701 GTACAAATGC TAGCTAACTA TAACATTGGC TACCAGGGCT TCTATATCCC 
CATGTTTACG ATCGATTGAT ATTGTAACCG ATGGTCCCGA AGATATAGGG 

21751 AGAGAGCTAC AAGGACCGCA TGTACTCCTT CTTTAGAAAC TTCCAGCCCA 
TCTCTCGATG TTCCTGGCGT ACATGAGGAA GAAATCTTTG AAGGTCGGGT 
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21801 TGAGCCGTCA (^GGTGGAT GATACTAMT ACAAGGAOTA*<:eWiC*g»G 
ACTCGGCAGT dBcACCTA CTATCATTTA TGTTCCTGAT GG.TTGT^C 



ACTCGGCACT 

21851 GGCATCCTAC ACCAACACAA CAACTCTGGA TTTGTTGGCT ACCTTGCCCC 
CCGTAGGATG TGGTTCTGTT GTTGAGACCT AAACAACCGA TGGAACGGGG 

21901 CACCATGCGC GAAGGACAGG CCTACCCTGC TAACTTCCCC TATCCGCTTA 
GTGGTACGCG CTTCCTGTCC GGATGGGACG ATTGAAGGCG ATAGGCGAAT 

21951 TAGGCAAGAC CGCAGTTGAC AGCATTACCC AGAAAAAGTT TCTTTGCGAT 
ATCCGTTCTG GCGTCAACTG TCGTAATGGG TCTTTTTCAA AGAAACGCTA 

22001 CGCACCCTPT GGCGCATCCC ATTCTCCAGT AACTTTATGT CCATGGGCGC 
GCGTGGGAAA CCGCGTAGGG TAAGAGGTCA TTGAAATACA GGTACCCGCG 

22051 ACTCACAGAC CTGGGCCAAA ACCTTCTCTA CGCCAACTCC GCCCACGCGC 
TCAGTGTCTG GACCCGGTTT TCGAAGAGAT GC6GTTGAGG CGGGTGCGCG 

22101 TAGACATGAC rTTTGAGGTG GATCCCATGG AC6AGCCCAC CCTTCTTTAT 
ATCTGTACTG AAAACTCCAC CTAGGGTACC TGCTCGGGTG GGAA6AAATA 

22151 GrrTTGTTTG AAGTCTTTGA CGTGGTCCGT GTQCACCAGC CGCACCGCGG 
CAAAACAAAC TTCAGAAACT GCACCAGGCA CACGTGGTCG GCGTGGCGCC 

22201 CGTCATCGAA ACCGTGTACC TGCGCACGCC CTTCTCGGCC GGCAACGCCA 
GCAGTAGCTT TGGCACATGG ACGCGTGCGG GAAGAGCCGG CCGTTGCGGT 

22251 CAACATAAAG AAGCAAGCAA CATCAACAAC AGCTGCCGCC ATGGGCTCCA 
GTTGTATTTC TTCGTTCGTT GTAGTTGTTG TCGACGGCGG TACCCGAGGT 

22301 GTGAGCAGGA ACTGAAAGCC ATTGTCAAAG ATCTTGGTTG TGGGCCATAT 
CACTCGTCCT TGACTTTCGG TAACAGTTTC TAGAACCAAC ACCCGGTATA 

22351 TTTTTGGGCA CCTATGACAA GCGCTTTCCA GGCTTTGTTT CTCCACACAA 
AAAAACCCGT GGATACTGTT CGCGAAAG6T CCGAAACAAA GAGGTGTGTT 

22401 GCTCGCCTGC GCCATAGTCA ATACGGCCGG TCGCGAGACT GGGGGCGTAC 
CGAGCGGACG CGGTATCAGT TATGCCGGCC AGCGCTCTGA CCCCCGCATG 

22451 ACTGGATGGC CTTTGCCTCG AACCCGCACT CAAAAACATG CTACCTCTTT 
T6ACCTACCG OAAACGGACC TTGGGCGTGA GTTTTTGTAC GATGGAOAAA 

22501 GAGCCCTTTC GCTTTTCTCA CCAGCGACTC AAGCAGGTTT ACCAGTTTGA 
CTC6GGAAAC C6AAAAGACT GGTCGCTGAG TTCGTCCAAA TGGTCAAACT 

22551 GTACGAGTCA CTCCTGCGCC GTAGCGCXaiT TGCTTCTTCC CCCGACCGCT 
CATGCTCAGT GAGGACGCGG CATCGCGGTA ACGAAGAAGG GGGCTGGCGA 

22601 GTATAACGCT GGAAAAGTCC ACCCAAAGCG TACAGGGGCC CAACTCGGCC 
StATTGCQA CCTTTTCy^QG TG6GTTTCGC ATGTCCCCCG GTTGACCCGG 

22651 GCCTGTGGAC TATTCTGCTG CATGTTTCTC CACGCCTTTG CCAACTGGCC 
CGGACACCTG ATAAGACGAC 6TACAAAGAG GTGC6GAAAC GGTTGACCGG 

22701 CCAAACTCCC ATCGATCACA ACCCCACCAT GAACCTTATT ACCGGGGTAC 
GGTTTCAGGG TftCCTAGTGT -roGGGrGGTA CTTGGAATAA TGGCCCCATG 
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22751 CCAACTCCAT 6CTCAACAGT CCCCAOGTAC AGCCCACqj^-Gf^JPQgp^c • 
GGTTGAGGTA q^||TTGTCA GGGGTCCATG TCGGGTGGGA CGCAGC^KG * 

22801 CAGGAACAGC TCTACAGCTT CCTGGAGCGC CACTCGCCCT ACTTCCGCAG 
GTCCTTGrCG AGATGTCGAA GGACCTCGCG GTGAGCGGGA TGAAGGCGTC 

22851 CCACAGTGCG CAGATTAGGA GCGCCACTTC TTTTTGTCAC TTGAAAAACA 
GGTGTCACGC GTCTAATCCT CGCGGTGAAG AAAAACAGTG AACTTTTTGT 

22901 TGTAAAAATA ATGTACTAGA GACACTTTCA ATAAAGGCAA ATGCTTTTAT 
ACATTTTTAT TACATGATCT CTGTGAAAGT TATTTCCGTT TACGAAAATA 

22951 TTGTACACTC TCGGGTGATT ATTTACCCCC ACCCTTGCCG TCTGCGCCGT 
AACATGTGAG AGCCCACTAA TAAATG6GGG TGGGAACG6C AGACGCGGCA 

23001 TTAAAAATCA AAGGGGTTCT GCCOCQCATC GCTATGCGCC ACTGGCAGGG 
AATTTTTAGT TTCCCCAAGA CGGCGCGTAG CGATACGCGG TGACCGTCCC 

23051 ACACGTTGCG ATACTGGTGT TTAGTGCTCC ACTTAAACTC AGGCACAACC 
TGTGCAACGC TATGACCACA AATCACGAGG TGAATTTGAG TCCGTGTTGG 

23101 ATCCGCGGCA GCTCGGTGAA GTTTTCACTC CACAGGCTGC GCACCATCAC 
TAGGCGCCGT CGAGCCACTT CAAAAGTGAG GTGTCCGACG CGTGGTAGTG 

23151 CAACGCGTTT AGCAGGTCGG GCGCCGATAT CTTGAAGTCG CAGTTG6GGC 
GTTGCGCAAA TC6TCCAGCC CGCGGCTATA GAACTTCAGC GTCAACCCCG 

23201 CTCCGCCCTG CGCGCGCGAG TTGCGATACA CAGGGTTGCA GCACTGGAAC 
GAGGCGGGAC GC6CGCGCTC AACGCTATGT GTCCCAACGT CGTCACCTTG 

23251 ACTATCAGCG CCGGGTGGTG CACGCTGGCC AGCACGCTCT TGTCGGAGAT 
TGATAGTCGC GGCCCACCAC GTGCGACCGG TCGTGCGAGA ACAGCCTCTA 

23301 CAGATCCGCG TCCAGGTCCT CCGCGTTGCT CAGGGCGAAC GGAGTCAACT 
GTCTAGGCGC AGGTCCAQGA GGCGCAACGA GTCCCGCTTG CCTCAGTTGA 

23351 TTGGTAGCTG CCTTCCCAAA AAGGGCGCGT GCCCAGGCTT TGAGTTGCAC 
AACCATCGAC GGAAGGGTTT TTCCCGCGCA CGGGTCCGAA ACTCAACGTG 

23401 TCGCACCGTA GTGGCATCAA AAGGTGACCG TGCCCGGTCT QGGCGTTAGG 
AGCGTGGCAT CACCGTAGTT TTCCACTGGC ACGGGCCAGA CCCGCAATCC 

23451 ATACAGCGCC TGCATAAAA6 CCTTGATCTG CTTAAAAGCC ACCTGAGCCT 
TATGTCGCGG ACGTATTTTC GGAACTAGAC GAATTTTCGG TGGACTCGGA 

23501 TTGCGCCTTC AGAGAAGAAC ATGCCGCAAG ACTTGCCGGA AAACTGATTG 
AACGCGGAAG TCTCTTCTTG TACGGCGTTC TGAACGGCCT TTTGACTAAC 

23551 GCCGGACAGG CCGCGTCGTG CACGCAGCAC CTTGCGTCGG TCTTGGAGAT 
CGGCCTGTCC G6CGCAGCAC GTGCGTCGTG GAACGCAGCC ACAACCTCTA 

23601 CTGCACCACA TTTC6GCCCC ACCGGTTCTT CACGATCTTG GCCTTGCTAG 
6ACGTGGTGT AAAGCCGGGG TGGCCAAGAA GTGCTAGAAC CGGAACGATC 

23651 ACTGCTCCTT CAGC6CGCGC TGCCC6TTTT CGCTCGTCAC ATCCATTTCA 
TGACGAGGAA 6TCGCGCGCG ACGGGCAAAA GCGAGCAGTG TAGGTAAAGT 
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23701 ATCACGTGCT cIRaTTTAT CATAATQCTT CCGTGTAGAC ACTTAaWc 
TAGTGCACGA GGAATAAATA GTATTACGAA GGCACATCTG TGAATTCCAG 

23751 GCCTTCGATC TCAGCGCAGC GGTGCAGCCA CAACGCGCAG CCCGTGGGCT 
CGGAAGCTAG AGTCGCGTCG CCACGTCGGT GTTGCGCGTC GGGCACCCGA 

23801 CGTGATGCTT GTAGGTCACC TCTGCAAACG ACTGCAGGTA CGCCTGCAGG 
GCACTACGAA CATCCAGTGG AGACGTTTGC TGACGTCCAT GCG6ACGTCC 

23851 AATCGCCCCA TCATCGTCAC AAAG6TCTT6 TTGCTGGTQA AGGTCAGCTG 
TTAGCGGGGT AGTAGCAGTG TTTCCAGAAC AACGACCACT TCCAGTCGAC 



23901 CAACCCGCGG TGCTCCTCGT TCAGCCAGGT CTTGCATAC6 GCCGCCAGAG 
GTTGGGCGCC ACGAGGAGCA AGTCCGTCCA GAACGTATGC CG6CGGTCTC 

23951 CTTCCACTTG GTCAGGCAGT AGTTTGAAGT TCGCCTTTAG ATCGTTATCC 
GAAGGTGAAC CAGTCCGTCA TCAAACTTCA AGCGGAAATC TAGCAATAGG 

24001 ACGTGGTACT TGTCCATCAG CGCGCGCGCA GCCTCCATGC CCTTCTCCCA 
TGCACCATGA ACAGGTAGTC GCGCGCGCGT CG6AGGTACG GGAAGAGGGT 

24051 CGCAGACACG ATCGGCACAC TCAGCGGGTT CATCACCGTA ATTTCACTTT 
GCGTCTGTGC TAGCCGTGTG AGTCGCCCAA GTAGTGGCAT TAAAGTGAAA 

24101 CCGCTTCGCT GGGCTCTTCC TCTTCCTCTT 6CGTCCGCAT ACCACGCGCC 
GGCGAAGCGA CCCGAGAAGG AGAAGGAGAA CGCAGGCGTA TGGTGCGCGG 

24151 ACTGGGTCGT CTTCATTCAG CCGCCGCACT GTGCGCTTAC CTCCTTTGCC 
TGACCCAGCA GAAGTAAGTC GGCGGCGTGA CACGCGAATG 6AGGAAACGG 

24201 ATGCTTGATT AGCACCGGTG GGTTGCTGAA ACCCACCATT TGTAGCGCCA 
TACGAACTAA TCGTGGCCAC CCAACGACTT TGGGTGGTAA ACATCGCGGT 

24251 CATCTTCTCT TTCTTCCTCG CTGTCCACGA TTACCTCTGG TGATGGCGGG 
GTAGAAGAGA AAGAAGGAGC GACAGGT6CT AATGGAGACC ACTACCGCCC 

24301 CGCTCGGGCT TGGGAGAAGG GCGCTTCTTT TTCTTCTTGG 6CGCAATGGC 
GCGAGCCCGA ACCCTCTTCC CGCGAAGAAA AAGAAGAACC CGCGTTACCG 

24351 CAAATCCGCC GCCGAGGTCG ATGGCCGCGG GCTGGGTGTG CGCGGCACCA 
GTTTAGGCGG CGGCTCCAGC TACCGGCGCC CGACCCACAC GCGCCGTGGT 

24401 GCGCGTCTTG TGATGAGTCT TCCTCGTCCT CGGACTCGAT ACGCCGCCTC 
CGCGCAGAAC ACTACTCAGA AGGAGCAGGA GCCTGAGCTA TCCGGCG6AG 

24451 ATCCGCTTTT TTGGGGGCGC CCGGGGAGGC GGCGGCGACG GGGACGGGGA 
TAGGCGAAAA AACCCCCGCG GGCCCCTCCG CCGCCGCTGC CCCTGCCCCT 

24501 CGACACGTCC TCCATGGTTG GGGGACGTC6 CGCCGCACCG CGTCCGCGCT 
GCTGTGCAGG AGGTACCAAC CCCCTGCAGC GCGGCGTGGC GCAGGCGCGA 

24551 CGGGGGTGGT TTCGCGCT6C TCCTCTTCCC GACTGOCCAT TTCCTTCTCC 
GCCCCCACCA AAGC6CGACG AGGAGAAGGG CTGACCGGTA AAGGAAGAGG 

24601 TATAGGCAGA AAAAGATCAT GGAGTCAGTC GAGAA6AAGG ACAGCCTAAC 
ATATCCGTCT TTTTCTAGTA CCTCAGTCAG CTCTTCTTCC TGTCGGATTG 
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246B1 
24701 
24751 
24601 

24651 
24901 
24951 
25001 
25051 
25101 
25151 
25201 
25251 
25301 
25351 
25401 
25451 
25501 
25551 



GCGGGGGAGA cWaAGCGGT GGTGGCGGAG GTGGCTACGG' CGGTTGcRg 

ctaccaccot ccxtcgtcgag gcacccccgc ttgaggagga ggaagtgatt 
gatggtggaa ggggcagctc cgtgggggcg aactcctcct ccttcactaa 

atcgagcagg acccaggttt tgtaagcgaa gaggacgagg accgctcagt 
tagctcgtcc tgggtccaaa acattcgctt ctgctgctcc tggcgagtca 

ACCAACAGAG GATAAAAAGC AAGACCAGGA CAACGCAGAG GCAAACGAGG 
TGGTTGTCTC CTATTTTTCG TTCTGGTCCT GTTGCGTCTC CGTTTGCTCC 



AACAA6TCGG GCGGGGGGAC GAAAGGCAT6 GCGACTACCT AGATGTGGGA 
TTGTTCAGCC CGCCCCCCTG CTTTCCGTAC CGCTGATGGA TCTACACCCT 

GACGACGTGC TGTTGAAGCA TCTGCAGCGC CAGTGCGCCA TTATCTGCGA 
CTGCTGCACG ACAACTTCGT AGACGTCGCQ GTCACGCGGT AATAGACGCT 

CGCGTTGCAA GAGCGCAGCG ATGTGCCCCT CGCCATAGCG GATGTCAGCC 
GCGCAACGTT CTCGCGTCGC TACACGGGGA GCGGTATCGC CTACAGTCGG 

TTGCCTACGA ACGCCACCTA TTCTCACCGC GCGTACCCCC CAAACGCCAA 
AACGGATGCT TGCGGTGGAT AAGAGTGGCG CGCATGGGGG GTTTGCGGTT 

GAAAACGGCA CATGCGAGCC CAACCCGCGC CTCAACTTCT ACCCCGTATT 
CTTTTGCCGT GTACGCTCGG GTTGGGCGC6 GAGTTGAAGA TGGGGCATAA 

TGCCGTGCCA GAGGTGCTTG CCACCTATCA CATCTTTTTC CAAAACTGCA 
ACGGCACGGT CTCCACGAAC GGTGGATAGT GTAGAAAAAG GTTTTGACGT 

AGATACCCCr ATCCTGCCGT GCCAACCGCA GCCGAGCGGA CAAGCAGCTG 
TCTATGGGGA TAGGACGGCA CGGTTGGCGT CGGCTCGCCT GTTCGTCGAC 

GCCTTGCGGC AGGGCGCTGT CATACCTGAT ATCGCCTCGC TCAACGAAGT 
CGGAACGCCG TCCCGCGACA GTATGGACTA TAGCGGAGCG AGTTGCTTCA 

GCCAAAAATC TTTGAGGGTC TTGGACGCGA CGAGAAGCGC GCGGCAMCG 
CGGTTTTTAG AAACTCCCAG AACCTGCGCT GCTCTTCGCG CGCCGTTTGC 

CTCTGCAACA GGAAAACAGC GAAAATGAAA GTCACTCTGG AGTGTTGGTG 
GAGACGTTGT CCTTTTGTCG CTTTTACTTT CAGTGA6ACC TCACAACCAC 

GAACTCGAGG GTGACAACGC GCGCCTAGC'c GTACTAAAAC GCAGCATCGA 
CTTGAGCTCC CACTGTTGCG CGCGGATCGG CATGATTTTG CGTCGTAGCT 

GGTCACCCAC TTTGCCTACC C6GCACTTAA CCTACCCCCC AAGGTCATGA 
CCAGTGGGTG AAACGGATGG 6CCGTGAATT GGATGOGGGQ TTCCAGTACT 

GCACAGTCAT GAGTGAGCTG ATCGTGCGCC GTGCGCAGCC CCTGGAGAGG 
CGTGTCAGTA CTCACTCGAC TAGCAC6C6G CACGCGTCGG GGACCTCTCC 

GATGCAAATT T6CAA6AACA AACAGAGGAG GGCCTACCCG CA6TTGGCGA 
CTACGTTTAA ACGrTTCTTGT TTGTCTCCTC CCGGATGGGC GTCAACCGCT 

CGAGCAGCTA GCGCGCTGGC TTCAAACGCG CGAGCCTGCC GACTTGGAGG 
GCTCGTC6AT C6CGCGACCG AAGTTTQCCC GCTCGGACGG CTGAACCTCC 
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25601 AGCGACGCAA /^p^TGATG GCCGCAGTGC TCGTTACCGT GGAGCIM^G 
TCGCTGCGTT TGATTACTAC CGGCGTCACG AGCAATGGCA CCTCGAACTC , 

25652 TGCATGCAGC GGTTCTTTGC TGACCCGGAG ATGCAGCGCA AGCTAGAGGA 
ACGTACGTCG CCAAGAAAC6" ACTGGGCCTC TACGTCGCGT TCGATCTCCT 

r 

25701 AACATTGCAC TACACCTTTC GACAGGGCTA CGTACGCCAG GCCTGCAAGA 
TTGTAACGTG ATGTGGAAAG CTGTCCCGAT GCATGCGGTC CGGACGTTCT 

25751 TCTCCAACGT GGAGCTCTGC AACCTGGTCT CCTACCTTGG AATTTTGCAC 
AGAGGTTGCA CCTCGAGACG TTGGACCAGA GGATGGAACC TTAAAACGTG 



25B01 GAAAACCGCC TTGGGCAAAA CGTGCTTCAT TCCACGCTCA AGGGCGAGGC 
CTTTTGGCGG AACCCGTTTT GCACGAAGTA AGGTGCGAGT TCCCGCTCCG 

25851 GCGCCGCGAC TACGTCC6CG ACTGCGTTTA CTTATTTCTA TGCTACACCT 
CGCGGCGCTG ATGCAGGCGC TGACGCAAAT GAATAAA6AT ACGATGTGGA 

25901 GGCAGACGGC CATGGGCGTT TGGCAGCAGT GCTTGGAGGA 6TGCAACCTC 
CCGTCTGCCG GTACCCGCAA ACCGTCGTCA CGAACCTCCT CACGTTGGAG 

25951 AAGGAGCTGC AGAAACTGCT AAAGCAAAAC TTGAAGGACC TATGGACGGC 
TTCCTCGACG TCTTTGACGA TTTCGTTTTG AACTTCCTGG ATACCTGCCG 

26001 CTTCAACGAG CGCTCCGTGG CCGCGCACCT GGCGGACATC ATTTTCCCCG 
GAAGTTGCTC GCGA6GCACC GGCGCGTGGA CC6CCTGTAG TAAAAGGGGC 

26051 AACGCCTGCT TAAAACCCTG CAACAGGGTC T6CCAGACTT CACCAGTCAA 
TTGCGGACGA ATTTTGGGAC GTTGTCCCAG ACGGTCTGAA GTGGTCAGTT 

26101 AGCATGTTGC AGAACTTTAG GAACTTTATC CTAGAGCGCT CAGGAATCTT 
TCGTACAACG TCTTGAAATC CTTGAAATAG GATCTCGCGA GTCCTTAGAA 

26151 GCCCGCCACC TGCTGTGCAC TTCCTAGCGA CTTTGTGCCC ATTAAGTACC 
CGGGCGGTGG ACGACACGTG AAGGATCGCT GAAACACGGG TAATTCATGG 

26201 GCGAATGCCC TCCGCCGCTT TGGGGCCACT GCTACCTTCT GCAGCTAGCC 
CGCTTACGGG AGGCGGCGAA ACCCCGGTGA CGATGGAAGA CGTCGATCGG 

26251 AACTACCTTG CCTACCACTC TGACATAATG GAAGACGTGA GCGGTGACGG 
TTGATGGAAC GGATGGTGAG ACTCTATTAC CTTCTGCACT CGCCACTGCC 

26301 TCTACTGGAG TGTCACTGTC GCTGCAACCT ATGCACCCCG CACCGCTCCC 
AGATGACCTC ACAGTGACAG CGACGTTGGA TACGTGGGGC GTGGCGAGGG 

26351 TGGTTTGCAA TTCGCAGCTG CTTAACGAAA GTCAAATTAT CGGTACCTTT 
ACCAAACGTT AAGCGTCGAC GAATTGCTTT CAGTTTAATA GCCATGGAAA 

26401 GAGCTGCAGG GTCCCTCGCC TGACGAAAAG TCCGCGGCTC CGGGGTTGAA 
CTCGACGTCC CAGGGAGCGG ACTGCTTTTC AGGCGCCGAG GCCCCAACTT 

26451 ACTCACTCCG GGGCTGTGGA CGTC6GCTTA CCTTCGCAAA TTTGTACCTG 
TGAGTGAGGC CCCGACACCT GCAGCCGAAT GGAAGCGTTT AAACATGGAC 

26501 AGGACTACCA CGCCCACGAG ATTAGGTTCT ACGAAGACCA ATCCCGCCCG 
TCCTGATGGT GCGGGTGCTC TAATCCAAGA TGCTTCTG6T TAGGGCGGGC 
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26551 
26601 
26651 
26701 

26751 
26B01 
26B51 
26901 
26951 
27001 
27051 
27101 
27151 
27201 
27251 
27301 
27351 
27401 
27451 



CCTAATGCGG AWtTTACCGC CTGCGTCATT ACCCAGGGCC ACATTCIRG 
6GATTACGCC TCGAATGGCG GACGCAGTAA TGGGTCCCGG TCrAAGAACC 

CCAATTGCAA GCCATCAACA AAGCCCGCCA AGAGTTTCTG CTACGAAAGG 
GGTTAACGTT CGGTAGTTGT TTCGGGCGGT TCTCAAAGAC GATGCTTTCC 

GACGGGGGGT TTACTTGGAC CCCCAGTCCG GCGAGGAGCT CAACCCAATC 
CTGCCCCCCA AATGAACCTG GGGGTCAGGC CGCTCCTCGA GTTGGGTTAG 

CCCCCGCCGC CGCAGCCCTA TCAGCAGCAG CCGCGGGCCC TTGCTTCCCA 
GGGGGCGGCG GCGTCGGGAT AGTCGTCGTC GGCGCCCG6G AACGAAGGGT 



GGATGGCACC CAAAAAGAAG CTGCAGCT6C C6CCGCCACC CACGGACGAG 
CCTACCGTGG GTTrTTCTTC GACGTCGACG GCGGCGGTGG GTGCCTGCTC 

GAGGAATACT GGGACAGTCA GGCAGAGGAG GTTTTGGACG AGGAGGAGGA 
CTCCTTATGA CCCTGTCAGT CCGTCTCCTC CAAAACCTGC TCCTCCTCCT 

GGACATGATG GAAGACTGGG AGAGCCTAGA CGAGGAAGCT TCCGAGGTCG 
CCTGTACTAC CTTCTGACCC TCTCQGATCT GCTCCTTCGA AGGCTCCAGC 

AAGAGGTGTC AGACGAAACA CCGTCACCCT CGGTCGCATT CCCCTCGCCG 
TTCTCCACAG TCTGCTTTGT GGCAGTG6GA GCCAGCGTAA GGGGAGCGGC 

GCGCCCCAGA AATCGGCAAC CGGTTCCAGC ATGGCTACAA CCTCCGCTCC 
CGCGGGGTCT TTAGCCGTTG GCCAAGGTCG TACCGATGTT GGAGGCGAGG 

TCAGGCGCCG CCGGCACTGC CCGTTCGCCG ACCCAACCGT AGATGGGACA 
AGTCCGCGGC GGCCGTGACG GGCAAGC6GC TGGGTTGGCA TCTACCCTGT 

CCACTGGAAC CAGGGCCGGT AAGTCCAAGC AGCCGCCGCC GTTAGCCCAA 
GGTGACCTTG GTCCCGGCCA TTCAGGTTCG TCGGCGGCGG CAATCGGGTT 

GAGCAACAAC AGCGCCAAGG CTACCGCTCA TGGCGCGGGC ACAAGAACGC 
CTCGTTGTTG TCGCGGTTCC GATGGCGAGT ACCGCGCCCG TGTTCTTGCG 

CATAGTTGCT TGCTTGCAAG ACTGTGGGGG CAACATCTCC TTCGCCCGCC 
GTATCAAC6A ACGAACGTTC TGACACCCCC GTTGTAGAGG AAGCGGGCGG 

GCTTTCTTCT CTACCATCAC GGCGTGGCCT TCCCCCGTAA CATCCTGCAT 
CGAAAGAAGA GATGGTAGTG CCGCACCGGA AGGGGGCATT GTAGGACGTA 

TACTACCGTC ATCTCTACAG CCCATACTGC ACCGGCGGCA GCGGCAGCAA 
ATGATGGCAG TAGAGATGTC 6GGTATGACG T6GCCGCCGT CGCCGTCGTT 

CAGCAGCGGC CACACAGAAG CAAAGGCGAC CCGATAGCAA GACTCTGACA 
GTCGTCGCCG GTGTGTCTTC GTTTCCGCTG GCCTATCGTT CTGAGACTGT 

AAGCCCAAGA AATCCACA6C 6GCGGCAGCA 6CAGGAGGAG GAGC6CTGCG 
TTCGGGTTCT TTAGGTGTCG CCGCCGTCGT CGTCCTCCTC CTCGCGACGC 

TCTGGCGCCC AACGAACCC6 TATCGACCCG CGAGCTTA6A AACAGGATTT 
AGACCGCGGG TTGCTTGGGC ATAGCTGGGC GCTCGAATCT TTGTCCTAAA 

TTCCCACTCT GTATGCTATA TTTCAACAGA GCAGGGGCCA AGAACAAGAG 
AAGGGTGAGA CATACGATAT AAAGTTGTCT CGTCCCCGGT TCTTGTTCTC 
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27501 CTGAAAATAA ;^p»CAGGTC TCTGCGATCC CTCACCCGCA GCTGCQ^fk 
GACTTTTATT l^TGTCCAG AGACGCTAGG GAGTGGGCGT CGACGgIRt 

27551 TCACAAAAGC GAAGATCAGC TTCGGCGCAC GCTGGAAGAC GCGGA66CTC 
AGTGTTTTC6 CTTCTAGTCG AAGCCGCGTG C6ACCTTCTG CGCCTCCGAG 

27601 TCTTCAGTAA ATACTGCGCG' CTGACTCTTA AGGACTAGTT TCGCGCCCTT 
AGAAGTCATT TATGACGCGC GACTGAGAAT TCCTGATCAA AGCGC6QGAA 

27651 TCTCAAATTT AAGCGCGAAA ACTACGTCAT CTCCAGCGGC CACACCCGGC 
AGAGTTTAAA TTC6CGCTTT TGATGCAGTA GAGGTCGCCG GTGTG6GCCG 



27701 GCCAGCACCT GTTGTCA6CG CCATTATGAG CAAGGAAATT CCCACGCCCT 
CGGTCGTGGA CAACAGTCGC GGTAATACTC 6TTCCTTTAA GGGTGCGGGA 

27751 ACATGTGGA6 TTACCAGCCA CAAATGGGAC TTGCGGCTGG AGCTGCCCAA 
TGTACACCTC AATGGTCGGT GTTTACCCTG AACGCCGACC TCGACGGGTT 

27B01 GACTACTCAA CCCGAATAAA CTACATGAGC GCGGGACCCC ACATGATATC 
CTGATGAGTT GGGCTTATTT GATGTACTCG CGCCCTGGGG TGTACTATAG 

27851 CCGGGTCAAC GGAATACGCG CCCACCGAAA CCGAATTCTC CTGGAACAGG 
GGCCCAGTTG CCTTATGCGC GGGTGGCTTT GGCTTAAGAG GACCTTGTCC 

27901 CGGCTATTAC CACCACACCT CGTAATAACC TTAATCCCCG TAGTTGGCCC 
GCCGATAATG GTGGTGTGGA GCATTATT6G AATTAGGGGC ATCAACCGGG 

27951 GCTGCCCTGG TGTACCAG6A AAGTCCCGCT CCCACCACTG TGGTACTTCC 
CGACGGGACC ACATGGTCCT TTCAGGGCGA GGGTGGTGAC ACCATGAAGG 

28001 CAGAGACGCC CAGGCCGAAG TTCAGATGAC TAACTCAGGG GCGCAGCTTG 
GTCTCTGCGG GTCCGGCTTC AAGTCTACTG ATTGAGTCCC CGCGTCGAAC 

28051 CGGGCGGCTT TCGTCACAGG GTGCGGTCGC CCGGGCAGGG TATAACTCAC 
GCCCGCCGAA AGCAGTGTCC CACGCCAGCG GGCCCGTCCC ATATTGAGTG 

28101 CTGACAATCA GAGGGCGAGG TATTCAGCTC AACGACGAGT CGGTGAGCTC 
GACTGTTAGT CTCCCGCTCC ATAAGTCGAG TTGCTGCTCA GCCACTCGAG 

28151 CTCGCTTGGT CTCCGTCCGG ACGGGACATT TCAGATCGGC GGCGCCGGCC 
GAGCGAACCA GAGGCAGGCC T6CCCTGTAA AGTCTAGCCG CCGCGGCCGG 

28201 . GCTCTTCATT CACGCCTCGT CAG6CAATCC TAACTCTGCA GACCTCGTCC 
CGAGAAGTAA 6TGCGGAGCA GTCCG7TAGG ATTGAGACGT CTGGAGCAGG 

28251 TCTGAGCCGC GCTCTGGAGG CATTGGAACT CTGCAATTTA TTGAGGAGTT 
AGACTCGGCG CGAGACCTCC GTAACCTTGA GACGTTAAAT AACTCCTCAA 

28301 TGTGCCATCG GTCTACTTTA ACCCCTTCTC GGGACCTCCC GGCCACTATC 
ACACGGTAGC CA6ATGAAAT TGGGGAAGAG CCCTGGAGGG CCGGTGATA6 

28351 CGGATCAATT TATTCCTAAC TTTGACGCGG TAAAGGACTC GGCG6ACGGC • 
GCCTAGTTAA ATAA6GATTG AAACTGCGCC ATTTCCTGAG CCGCCTGCC6 

28401 TACGACTQAA TGTTAAGTGG AGAGGCAGAG CAACTGCGCC TGAAACACCT 
ATGCTGACTT ACAATTCACC TCTCCGTCTC GTTGACGCGG ACTTTGTGGA 
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28451 GGTCCACTGT cliftGCCACA AGTGCTTTGC CCGCGACTCC GGTGAdW^T 
CCAGGTGACA GCGGCGGTGT TCACGAAACG GGCGCTGAGG CCACTCAAAA 

28501 GCTACTTTGA ATTGCCCGAG GATCATATCG AGGGCCCGGC GCACGGCGTC 
CGATGAAACT TAACGGGCTC CTAGTATAGC TCCCGGGCCG CGTGCCGCAG 

28551 CGGCTTACCG CCCAGGGAGA GCTTGCCCGT AGCCTGATTC GGGAGTTTAC 
GCCGAATGGC OGGTCCCTCT CGAACG6GCA TCG6ACTAAG CCCTCAAATG 

28601 CCAGCGCCCC CTGCTAGTTG AGCGGGACAG GGGACCCTGT GTTCTCACTG 
GGTCGCGGGG GACGATCAAC TCGCCCTGOC CCCTGGGACA CAAGAGTGAC 



28651 TGATTTGCAA CTGTCCTAAC CCTGGATTAC ATCAAGATCT TTGTTGCCAT 
ACTAAACGTT GACAGGATTG GGACCTAATG TAGTTCTAGA AACAACGGTA 

2 87 01 CTCTGTGCTG AGTATAATAA ATACAGAAAT TAAAATATAC TGGGGCTCCT 
GAGACACGAC TCATATTATT TATGTCTTTA ATTTTATATG ACCCCGAGGA 

28751 ATCGCCATCC TGTAAACGCC ACCGTCTTCA CCCGCCCAAG CAAACCAAGG 
TAGCGGTAGG ACATTTGCGG TGQCAGAAGT GGGCGGGTTC GTTTGGTTCC 

28801 CGAACCTTAC CTGGTACTTT TAACATCTCT CCCTCTGTGA TTTACAACAG 
GCTTGGAATG GACCATGAAA ATT6TAGAGA GGGAGACACT AAATGTTGTC 

28851 TTTCAACCCA GACGGAGTGA GTCTACGAGA GAACCTCTCC GAGCTCAGCT 
AAAGTTGGGT CTGCCTCACT CAGATGCTCT CTTGGAGAGG CTCGAGTCGA 

28901 ACTCCATCAG AAAAAACACC ACCCTCCTTA CCTGCCGGGA ACGTACGAGT 
TGAGGTAGTC TTTTTTGTGG TGGGAGGAAT GGACGGCCCT TGCATGCTCA 

28951 GCGTCACCGG CCGCTGCACC ACACCTACCG CCTGACCGTA AACCAGACTT 
CGCAGTGGCC GGCGACGTGG TGTGGATGGC GGACTGGCAT TTGGTCTGAA 

29001 TTTCCGGACA GACCTCAATA ACTCTGTTTA CCAGAACAGG AGGTGAGCTT 
AAAGGCCTGT CTGGAGTTAT TGAGACAAAT GGTCTTGTCC TCCACTCGAA 

29051 AGAAAACCCT TAGGGTATTA GGCCAAAGGC GCAGCTACTG TGGGGTTTAT 
TCTTTTGGGA ATCCCATAAT CCGGTTTCCG CGTCGATGAC ACCCCAAATA 

29101 GAACAATTCA AGCAACTCTA CGGGCTATTC TAATTCAGGT TTCTCTAGAA 
CTTGTTAAGT TCGTTGAGAT GCCCGATAAG ATTAAGTCCA AAGAGATCTT 

29151 TCGGGGTTGG GGTTATTCTC TGTCTTGTGA TTCTCTTTAT TCTTATACTA 
AGCCCCAACC CCAATAAGAG ACAGAACACT AAGAGAAATA AGAATATGAT 

29201 ACGCTTCTCT GCCTAAGGCT CGCCGCCTGC TGTGTGCACA TTTGCATTTA 
TGCGAAGAGA CGGATTCCGA GCGGCGGACG ACACACGTGT AAAC6TAAAT 

29251 TTGTCAGCTT TTTAAACGCT GGGGTCGCCA CCCAAGATGA TTAG6TACAT 
AACAGTCGAA AAATTTGCGA CCCCAGCGGT GGGTTCTACT AATCCAT6TA 

29301 AATCCTAGGT TTACTCACCC TTGCGTCAGC CCACGGTACC ACCCAAAAGG 
TTAGGATCCA AAT6AGTGGG AACGCAGTCG GGTGCCATGG TGGGTTTTCC 

29351 TCGATTTTAA GGAGCCAGCC TGTAATGTTA CATTCGCAGC TGAAGCTAAT 
ACCTATiAATT CCTCGGTCGG ACATTACAAT GTAAGCGTCG ACTTCGATTA 
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29401 GAGTOCACCA C^TATAAA ATGCACCACA GAACATCAAA ACiC-J*i:K^pT 

CTCACGTGGT GAGAATATTT TACGTGGTGT CTTGTACTTT TCQACGAATA 

29451 TCGCCACAAA AACAAAATT6 GCAAGTATGC TGTTTATGCT ATTTGGCAGC 
AGCGGTGTTT TTGTTTTAAC CGTTCATAC6 ACAAATACGA TAAACCGTCG 

29501 CAGGTGACAC TACAGAGTAT AATGTTACAG TTTTCCAGGG TAAAAGTCAT 
GTCCACTGTG AT6TCTCATA TTACAATGTC AAAAGGTCCC ATTTTCAGTA 

29551 AAAACTTTTA TGTATACTTT TCCATTTTAT GAAATGTGCG ACATTACCAT 
TTTTGAAAAT ACATATGAAA AGGTAAAATA CTTTACACGC TGTAATGGTA 



29601 GTACATGAGC AAACAGTATA AGTTGTG6CC CCCACAAAAT TGTGTGGAAA 
CATGTACTCG TTTGTCATAT TCAACACCGG GGGTCTTTTA ACACACCTTT 

29651 ACACTGGCAC TTTCTGCTGC ACT6CTATGC TAATTACAGT GCTCGCTTTG 
TGTGACCGTG AAAGACGACG TGACGATACG ATTAATGTCA CGAGCGAAAC 

29701 GTCTGTACCC TACTCTATAT TAAATACAAA AGCAGACGCA GCTTTATTGA 
CAGACATGGG ATGAGATATA ATTTATGTTT TCGTCTGCGT CGAAATAACT 

29751 GGAAAAGAAA ATGCCTTAAT TTACTAAGTT ACAAAGCTAA TGTCACCACT 
CCTTTTCTTT TACGGAATTA AATGATTCAA TGTTTCGATT ACAGTGGTGA 

29801 AACTGCTTTA CTCGCTCCTT GCAAAACAAA TTCAAAAAGT TAGCATTATA 
TTGACGAAAT GAGCGACGAA CGTTTT6TTT AAGTTTTTCA ATCGTAATAT 

29851 ATTAGAATAG GATTTAAACC CCCCGGTCAT TTCCTGCTCA ATACCATTCC 
TAATCTTATC CTAAATTTGG GG6GCCAGTA AAGGACGAGT TATGGTAAGG 

29901 CCTGAACAAT TGACTCTATG TGGGATATGC TCCAGCGCTA CAACCTTGAA 
GGACTTGTTA ACTGAGATAC ACCCTATACG AGGTCGCGAT GTTGGAACTT 

29951 GTCAGGCTTC CTGGATGTCA GCATCTGACT TTGGCCAGCA CCTGTCCCGC 
CAGTCCGAAG GACCTACAGT CGTAGACTGA AACCGGTCGT GGACAGGGCG 

30001 GGATTTGTTC CAGTCCAACT ACAGCGACCC ACCCTAACAG AGATGACCAA 
CCTAAACAAG GTCAGGTTGA TGTCGCTGGG TGG6ATTGTC TCTACTGGTT 

30051 CACAACCAAC GCGGCCGCCG CTACCGGACT TACATCTACC ACAAATACAC 
GTGTTGGTTG CGCCGGCGGC GATGGCCTGA ATGTAGATGG TGTTTATGTG 

30101 CCCAAGTTTC TGCCTTTGTC AATAACTGGG ATAACTTGGG CATGTGGTGG 
GG6TTCAAAG ACGGAAACAG TTATTGACCC TATTCAACCC GTACACCACC 

30151 TTCTCCATAG CGCTTATGTT TGTATGCCTT ATTATTATGr GGCTCATCTG 
AAGAGGTATC GCGAATACAA ACATACGGAA TAATAATACA CCGAGTAGAC 

30201 CTGCCTAAAG CGCAAACGCG CCCGACCACC CATCTATAGT CCCATCATT6 
GACGGATTTC GC6TTTGCGC GGGCTGGT6G GTAGATATCA 6G6TAGTAAC 

30251 TGCTACACCC AAACAAT6AT GGAATCCATA GATTGGACGG ACTGAAACAC 
ACGAT6TGGG TTTGTTACTA CCTTAGGTAT CTAACCTGCC TGACTTTGTG 

30301 ATGTTCTTTT CTCTTACAGT ATGATTAAAT GAGACATGAT TCCTCGAGTT 
TACAAGAAAA GAGAATGTCA TACTAATTTA CTCTGTACTA AGGAGCTCAA 
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30351 TTTATATTAC iBpCCTTGT TGCGCTTTTT TGTGCGTGCT CCACAT^pc - 
AAATATAATG ACTGGGAACA ACGCGAAAAA ACACGCACGA GGTGTA^G 

30401 TGCGGTTTCT CACATC6AAG TAGACTGCAT TCCAGCCTTC ACAGTCTATT 
ACGCCAAAGA GTGTAGCTTC ATCTGACGTA AG6TCGGAA6 TGTCAGATAA 

30451 TGCTTTACGG ATTTGTCACC CTCACGCTCA TCTGCAGCCT CATCACTGTG 
ACGAAATGCC TAAACAGT6G GAGTGCGA6T AGACGTCGGA GTAGTGACAC 

30501 GTCATCGCCT TTATCCAGTG CATTGACTGG GTCTGTGTGC GCTTTGCATA 
CAGTAGCGGA AATAGGTCAC GTAACTGACC CAGACACACG CGAAACGTAT 



30551 TCTCAGACAC CATCCCCAGT ACAGGGACAG GACTATAGCT GAGCTTCTTA 
AGAGTCTGTG GTAGGGGTCA TGTCCCTGTC CTGATATCGA CTCGAAGAAT 

30601 GAATTCTTTA ATTATQAAAT TTACTGTGAC TTTTCTGCTG ATTATTTGCA 
CTTAAGAAAT TAATACTTTA AATGACACTG AAAAGACGAC TAATAAACGT 

30651 CCCTATCTGC GTTTTGTTCC CCGACCTCCA AGCCTCAAA6 ACATATATCA 
GGGATAGACG CAAAACAAGG GGCTGGAGGT TCGGAGTTTC TGTATATAGT 

30701 TGCAGATTCA CTCGTATATG GAATATTCCA AGTTGCTACA ATGAAAAAAG 
ACGTCTAAGT GAGCATATAC CTTATAACGT TCAACGATGT TACTTTTTTC 

30751 CGATCTTTCC CAAGCCTGGT TATATGCAAT CATCTCTGTT ATGGTGTTCT 
GCTAGAAAGG CTTCGGACCA ATATACGTTA GTAGAGACAA TACCACAAGA 

30801 GCAGTACCAT CTTAGCCCTA GCTATATATC CCTACCTTGA CATTGGCTGG 
CGTCATGGTA GAATCGGGAT CGATATATAG GGATGGAACT GTAACCGACC 

30B51 AACGCAATAG ATGCCATGAA CCACCCAACT TTCCCCGCGC CCGCTATGCT 
TTGCGTTATC TACGGTACTT GGTGGGTTGA AAGGGGCGCG GGCGATACGA 

30901 TCCACTGCAA CAAGTTGTTG CCG6CGGCTT TGTCCCAGCC AATCAGCCTC 
AGGTGACGTT GTTCAACAAC GGCCGCCGAA ACAGGGTCGG TTAGTCGGAG 

30951 GCCCACCTTC TCCCACCCCC ACTGAAATCA GCTACTTTAA TCTAACAGGA 
CGGGTGGAAG AGGGTGGGGG TGACTTTAGT CGATGAAATT AGATTGTCCT 

31001 GGAGATGACT GACACCCTAG ATCTAGAAAT OGACGGAATT ATTACAGAGC 
CCTCTACTGA CTGTGGGATC TAGATCTTTA CCTGCCTTAA TAATGTCTCG 

31051 AGCGCCTGCT AGAAAGACGC AGGGCAGCGG CCGAGCAACA GCGCATGAAT 
TCGCGGACGA TCTTTCTGCG TCCCGTCGCC GGCTCGTTGT CGCOTACTTA 

31101 CAAGAGCTCC AAGACATGGT TAACTTGCAC CAGTGCAAAA GGGGTATCTT 
GTTCTCGAGG TTCTGTACCA ATTGTIACGTG GTCACGTOTT CCCCATAGAA 

31151 TTGTCTCGTA AAGCAGGCCA AAGTCACCTA CGACAGTAAT ACCACCGGAC 
AACAGAGCAT TTCGTCCG6T TTCAGTGGAT GCT6TCATTA TGGTGQCCTG 

31201 ACCGCCTTAG CTACAAGTTG CCAACCAAGC GTCAGAAATT GGTGGTCATG 
TGGCGGAATC GATGTTCAAC GGTTGGTTCG CAGTCTTTAA CCACCAGTAC 

31251 GTGGGAGAAA AGCCCATTAC CATAACTCAG CACTCG6TAG AAACCGAAGG 
CACCCTCTTT TCGGGTAATG GTATTGAGTC GTGAGCCATC TTTGGCTTCC 
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31301 . CTGCATTCAC" T^TCTTGTC AAGGACCTGA GGATCTCTGC ACCCTTj^'A 
GACGTAAGTG ACTGGAACAG TTCCTGGACT CCTAGAGACG TGGGAJ^Rt 

31351 AGACCCTGTG CGGTCTCAAA GATCTTATTC CCTTTAACTA ATAAAAAAAA 
TCTGGGACAC GCCAGAGTTT CTAGAATAAG GGAAATTGAT TATTTTTTTT 

31401 ATAATAAAGC ATCACTTACT TAAAATCAGT TAGCAAATTT CTGTCCAGTT 
TATTATTTCG TAGTGAATGA ATTTTAGTCA ATCGTTTAAA GACAGGTCAA 

31451 TATTCAGCAG CACCTCCTTG CCCTCCTCCC AGCTCTGGTA TTGCAGCTTC 
ATAAGTCGTC GTGGAGGAAC GGGAGGAGGG TCGAGACCAT AACGTCGAAG 



31501 CTCCTGGCTG CAAACTTTCT CCACAATCTA AATGGAATGT CAGTTTCCTC 
GAGGACCGAC GTTTGAAAGA GGTGTTAGAT TTACCTTACA GTCAAAGGAG 

31551 CTGTTCCTGT CCATCCGCAC CCACTATCTT CATGTTGTTG CAGATGAAGC 
GACAAGGACA GGTAGGCGTG GGTGATAQAA GTACAACAAC GTCTACTTCG 

31601 GCGCAAGACC GTCTGAAGAT ACCTTCAACC CC6TGTATCC ATATGACACG 
C6CGTTCT6G CAGACTTCTA T6GAAGTTGG GGCACATAGG TATACTGTGC 

31651 GAAACCGGTC CTCCAACTGT GCCTTTTCTT ACTCCTCCCT TTGTATCCCC 
CTTTGGCCAG GAGGTTGACA CGGAAAAGAA TGAGGAGGGA AACATAGGGG 

31701 CAATCGGTTT CAAGAGAGTC CCCCTGQGGT ACTCTCTTTG CGCCTATCCG 
GTTACCCAAA GTTCTCTCAG GGGGACCCCA TGAGAGAAAC GCGGATAGGC 

31751 AACCTCTAGT TACCTCCAAT GGCATGCTTG CGCTCAAAAT GGGCAACGGC 
TT6GAGATCA ATGGAGGTTA CCGTACGAAC GCGAGTTTTA CCCGTTGCCG 

•JlfiOl CTCTCTCTGG ACGAGGCCGG CAACCTTACC TCCCAAAATG TAACCACTGT 
GAGAGAGACC TGCTCCGGCC GTTGGAATGG AGGGTTTTAC ATTGGTGACA 

31851 GAGCCCACCT CTCAAAAAAA CCAAGTCAAA CATAAACCTG GAAATATCT6 
CTCGGGTGGA GAGTTTTTTT GGTTCAGTTT GTATTTGGAC CTTTATAGAC 

31901 CACCCCTCAC AGTTACCTCA GAAGCCCTAA CTGTGGCTGC CGCCGCACCT 
GTGGGGAGTG TCAATGGAGT CTTCGGGATT GACACCGACG GCGGCGTGGA 

31951 CTAATGGTCG CGGGCAACAC ACTCACCATG CAATCACA6G CCCCGCTAAC 
GATTACCAGC GCCCGTTGTG TGAGTGGTAC GTTAGTGTCC GGGGCGATTG 

32001 CGTGCACGAC TCCAAACTTA GCATTGCCAC CCAAGGACCC CTCACA6TGT 
GCACGTGCTG AGGTTTGAAT C6TAACGGTG 6GTTCCTGGG GAGTGTCACA 

32051 CAGAAGGAAA QCTAGCCCTG CAAACATCAG GCCCCCTCAC CACCACCGAT 
GTCTTCCTTT CGATCGGGAC GTTTGTAGTC CGGGGGAGTG GTGGTGGCTA 

32101 AGCAGTACCC TTACTATCAC TGCCTCACCC CCTCTAACTA CTGCCACTGG 
TCGTCAT6GG AATGATAGTG AC6GAGTGG6 GGAGATTGAT GACGGTGACC 

32151 TAGCTTGGGC ATTGACTTGA AAGAGCCCAT TTATACACAA AATGGAAAAC 
ATCGAACCCG TAACTGAACT TTCTCGGGTA AATATGTGTT TTACCTTTTG 

32201 TAGGACTAAA GTACGGG6CT CCTTTGCATG TAACAGACGA CCTAAACACT 
ATCCTGATTT CATGCCCCGA GGAAAC6TAC ATTGTCTGCT 6GATTTGTGA 
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32251 TTGACCGTAG MMTTGGTCC AGGTGTK3ACT ATTAATAJt^fXHrTPOtrCllltA 
AACTGGCATC HPgACCAGG TCCACACTGA TAATTATTAT GAAGgI^ST 

32301 AACTAAAGTT ACTGGAGCCT tgggttttga ttcacaaggc aatatgcaac 
ttgatttcaa tgacctcgga acccaaaact aagtgttccg ttatacgttg 

32351 ttaatgtagc aggaggacta aggattgatt ctcaaaacag acgccttata 
aattacatcg tcctcctgat tcctaactaa gagttttgtc tgcggaatat 

32401 cttgatgtta gttatccgtt tgatgctcaa aaccaactaa atctaagact 

6AACTACAAT CAATAGGCAA ACTACGAGTT TTGGTTGATT TAGATTCTGA 



32451 AGGACAGGGC CCTCTTTTTA TAAACTCAGC CC/iCAACTTG GATATTAACT 
TCCTGTCCCG GGAGAAAAAT ATTTGAGTCG GGTGTTGAAC CTATAATTGA 

32501 ACAACAAAGG CCTTTACTTG TTTACAGCTT CAAACAATTC CAAAAAGCTT 
TGTTGTTTCC GGAAATGAAC AAATGTCGAA GTTTGTTAAG GTTTTTCGAA 

32551 GAGGTTAACC TAAGCACTGC CAAGGGGTTG ATGTTTGACG CTACAGCCAT 
CTCCAATTGG ATTCGTGACG GTTCCCCAAC TACAAACTGC GATGTCGGTA 

32601 AGCCATTAAT GCAGGAGATG GGCTTGAATT TGGTTCACCT AATGCACCAA 
TCGGTAATTA CGTCCTCTAC CCGAACTTAA ACCAAGTGGA TTACGTGGTT 

32651 ACACAAATCC CCTCAAAACA AAAATTGGCC AT6GCCTA6A ATTTGATTCA 
TGTGTTTAGG GGAGTTTTGT TTTTAACCGG TACCGGATCT TAAACTAAGT 

32701 AACAAGGCTA TGGTTCCTAA ACTAGGAACT 66CCTTAGTT TTGACAGCAC 
TTGTTCCGAT ACCAAGGATT TGATCCTTGA CCGGAATCAA AACTGTCGTG 

32751 AGGTGCCATT ACAGTAGGAA ACAAAAATAA TGATAAGCTA ACTTTGTGGA 
TCCACGGTAA TGTCATCCTT TGTTTTTATT ACTATTCGAT TGAAACACCT 

32B01 CCACACCAGC TCCATCTCCT AACTGTAGAC TAAATGCAGA GAAAGATGCT 
GGTGTGGTCG AGGTAGAGGA TTGACATCTG ATTTACGTCT CTTTCTACGA 

32B51 AAACTCACTT TGGTCTTAAC AAAATGTGGC AGTCAAATAC TTGCTACAGT 
TTTGAGTGAA ACCAGAATTG TTTTACACCG TCAGTTTATG AACGATGTCA 

32901 TTCAGTTTTG GCTGTTAAAQ GCAGTTTGGC TCCAATATCT GGAACAGTTC 
AAGTCAAAAC CGACAATTTC CGTCAAACCG AGGTTATAGA CCTTGTCAAG 

32951 AAAGTGCTCA TCTTATTATA AGATTTGACG AAAATGGAGT GCTACTAAAC 
TTTCACGAGT AGAATAATAT TCTAAACTGC TTTTACCTCA CGATGATTTG 

33001 AATTCCTTCC TGGACCCAGA ATATTGGAAC TTTAGAAATG GAGATCTTAC 
TTAAGGAAGG ACCTGCGTCT TATAACCTTG AAATCTTTAC CTCTAGAATG 

33051 TGAAGGCACA GCCTATACAA ACGCTGTTGG ATTTATGCCT AACCTATCAG 
ACTTCCGTGT CGGATATGTT TGCGACAACC TAAATACGGA TTGGATAGTC 

33101 CTTATCCAAA ATCTCAC6GT AAAACTGCCA AAAGTAACAT TGTCAGTCAA 
GAATAGGTTT TAGAGTGCCA TTTTGACGGT TTTCATTGTA ACA6TCAGTT 

33151 GTTTACTTAA ACGGAGACAA AACTAAACCT GTAACACTAA CCATTACACT 
CAAATGAATT TGCCTCTGTT TTGATTTGGA CATTGTGATT GGTAATGTGA 
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33201 AAACGGTACA C^pAAACAG GAGACACAAC TCCAAGTGlfcA TACTC^Kt 
TTTGCCATGT GW:TTTGTC CTCTGTGTTG AGGTTCACGT ATGAGmIcA 

33251 CATTTTCATG GGACTG6TCT GGCCACAACT ACATTAATGA AArATTTGCC 
GTAAAAGTAC CCTGACCAGA CCGGTGTTGA T6TAATTACT TTATAAACGG 

33301 ACATCCTCTT ACACTTTTTC ATACATTGCC CAAGAATAAA GAATCGTTTG 
TGTA6GAGAA TGTGAAAAA6 TATGTAACGG GTTCTTATTO CTTAGCAAAC 

33351 TGTTATGTTT CAACGTGTTT ATTTTTCAAT TGCAGAAAAT TTCAACTCAT 
ACAATACAAA GTTGCACAAA TAAAAAGTTA ACGTCTTTTA AAGTTCAGTA 



33401 TTTTCATTCA GTAGTATAGC CCCACCACCA CATAGCTTAT ACAGATCACC 
AAAAGTAAGT CATCATATCG GGGTGGT6GT GTATCGAATA TGTCTAGTG6 

33451 GTACCTTAAT CAAACTCACA GAACCCTAGT ATTCAACCTG CCACCTCCCT 
CATGGAATTA GTTTGAGTGT CTTGGGATCA TAAGTTGGAC GGTGGAGGGA 

33501 CCCAACACAC AGAGTACACA GTCCTTTCTC CCCGGCTGGC CTTAAAAAGC 
GG6TTGTGTG TCTCATGTGT CAGGAAAGAG GGGCCGACCG GAATTTTTCG 

33551 ATCATATCAT GGGTAACAGA CATATTCTTA GGTGTTATAT TCCACACGGT 
TAGTATAGTA CCCATTQTCT GTATAAGAAT CCACAATATA AGGTGTGCCA 

33601 TTCCTGTCGA GCCAAACGCT CATCAGTGAT ATTAATAAAC TCCCCGGGCA 
AAGGACAGCT CGGTTTGCGA GTAGTCACTA TAATTATTTG AGGGGCCCGT 

33651 GCTCACTTAA GTTCATGTCG CTGTCCAGCT GCTGAGCCAC AGGCTGCTGT 
CGAGTGAATT CAAGTACAGC GACAGGTCGA CGACTCGGTG TCCGACGACA 

33701 CCAACTTGCG GTTGCTTAAC GGGCGGCGAA GGAGAAGTCC ACGCCTACAT 
GGTTGAACGC CAACGAATTG CCCGCCGCTT CCTCTTCAGG TGCGGATGTA 

33751 GGGGGTAGAG TCATAATCGT GCATCAGGAT AGGGCGGTGG TGCTGCAGCA 
CCCCCATCTC AGTATTAGCA CGTAGTCCTA TCCCGCCACC ACGACGTCGT 

33801 GCGCGCGAAT AAACTGCTGC CGCCGCCGCT CCGTCCTGCA GGAATACAAC 
CGCGCGCTTA TTTGACGACG GCGGCGGCGA GGCAGGACGT CCTTATGTTG 

33851 ATGGCAGTGG TCTCCTCAGC GATGATTCGC ACCGCCCGCA GCATAAGGCG 
TACCGTCACC AGAGGAGTC6 CTACTAAGC6 TGGCGGGCGT CGTATTCCGC 

33901 CCTTGTCCTC CGGGCACA6C AGCGCACCCT GATCTCACTT AAATCAGCAC 
GGAACAGGAG GCCC6T6TCG TC6CGTGGGA CTA6AGTGAA TTTAGTC6TG 

33951 AGTAACTGCA GCACAGCACC ACAATATTGT TCAAAATCCC ACAGTGCAAG 
TCATTGACGT CGTGTCGTGG TGTTATAACA AGTTTTAGGG TGTCACGTTC 

34001 GCGCTGTATC CAAAGCTCAT GGCGGGGACC ACAGAACCCA CGTGGCCATC 
CGCGACATAG GTTTCGAGTA CCGCCCCTG6 TGTCTTGGGT GCACCGGTAG 

34051 ATACCACAAG CGCAGGTAGA TTAAGTGGCG ACCCCTCATA AACACGCTGG 
TATGGTGTTC GCGTCCATCT AATTCACCGC TGGGGAGTAT TTGTGCGACC 

34101 ACATAAACAT TACCTCTTTT GGCATGTTGT AATTCACCAC CTCCCGGTAC 
TGTATTTGTA ATGGAGAAAA CCCTACAACA TTAAGTOGTG GAGGGCCATG 
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34151 CATATAAACC M^ATTAAA CATGGCGCCA TCCACCACCA-TCCTAJQ|||A ' 
GTATATTTGG SIStAATTT GTACCGCGGT AGGTGGTGGT AGGATO^Rt 

34201 GCTGGCCAAA ACCTGCCCGC CGGCTATACA CTGCAGGGAA CCGGGACTGG 
CGACCGGTTT TGGACGGGCG GCCGATATGT gacgtccctt ogccctgacc 

34251 aacaatgaca gtggagagcc caggactcgt aaccatggat catcatgctc 
ttgttactgt cacctctcgg gtcctgagca ttggtaccta gtagtacgag 

34301 gtcatgatat caatgttggc acaacacagg cacacgtgca tacacttcct 

CAGTACTATA GTTACAACCG TGTTGTGTCC GTGTGCACGT ATGTGAAGGA 



34351 CAGGATTACA AGCTCCTCCC GCGTTAGAAC CATATCCCAG GGAACAACCC 
GTCCTAATGT TCGAGGAGGG CGCAATCTTG GTATAGGGTC CCTTGTTGGG 

34401 ATTCCTGAAT CAGCGTAAAT CCCACACTGC AGGGAAGACC TCGCACGTAA 
TAAGGACTTA 6TCGCATTTA GGGTGTGACG TCCCTTCTGG A6CGTGCATT 

34451 CTCACGTTGT GCATTGTCAA AGTGTTACAT TCGGGCAGCA GCGGATGATC 
GAGTGCAACA CGTAACAGTT TCACAATGTA AGCCCGTCGT CGCCTACTAG 

34501 CTCCAGTATG GTAGCGCGGG TTTCTGTCTC AAAAGGAGGT AGACGATCCC 
GAGGTCATAC CATCGCGCCC AAAGACAGAG TTTTCCTCCA TCTGCTAGGG 

34551 TACTGTACGG AGTGCGCCGA GACAACCGAG ATCGTGTTGG TCGTAGTGTC 
AT6ACAT6CC TCACGCGGCT CTGTTGGCTC TAGCACAACC AGCATCACAG 

34601 ATGCCAAATG GAACGCCGGA CGTAGTCATA TTTCCTGAAG CAAAACCAGG 
TACGGTTTAC CTTGCGGCCT GCATCAGTAT AAAGGACTTC GTTTTGGTCC 

34651 TGCGGGCGTG ACAAACAGAT CTGCGTCTCC GGTCTCGCCG CTTAGATCGC 
ACGCCCGCAC TGTTTGTCTA GACGCAGAGG CCAGAGCGGC GAATCTAGCG 

34701 TCTGTGTAGT AGTTGTAGTA TATCCACTCT CTCAAAGCAT CCAGGCGCCC 
AGACACATCA TCAACATCAT ATAGGTGAGA GAGTTTCGTA GGTCCGCGGG 

34751 CCTGGCTTCG 6GTTCTATGT AAACTCCTTC ATGCGCCGCT GCCCTGATAA 
GGACCGAAGC CCAAGATACA TTTGAGGAAG TACGC6GCGA CGGGACTATT 

34801 CATCCACCAC CGCAGAATAA GCCACACCCA GCCAACCTAC ACATTCGTTC 
GTAGGTGGTG GCGTCTTATT CGGTGTGGGT CGGTTGGATG TGTAAGCAAG 

34B51 TGCGAGTCAC ACACGGGAGG AGCGGGAAGA GCTGGAAGAA CCATGTTTTT 
ACGCTCAGTG TGTGCCCTCC TCGCCCTTCT CGACCTTCTT GGTACAAAAA 

34901 TTTTTTATTC CAAAAGATTA TCCAAAACCT CAAAATGAAG ATCTATTAAG 
AAAAAATAAG CTTTTCTAAT AGGTTTTG6A GTTTTACTTC TAGATAATTC 

34951 TCAACGCGCT CCCCTCCGGT GGCGTGGTCA AACTCTACAG CCAAAGAACA 
ACTTGCGCGA GGGGAG6CCA CCGCACCAGT TTGAGATGTC GGTTTCTTGT 

35001 GATAATGGCA TTTGTAAGAT GTTGCACAAT GGCTTCCAAA AGGCAAACGG 
CTATTACCGT AAACATTCTA CAACGTGTTA CCGAAGGTTT TCCGTTTGCC 

35051 CCCTCACGTC CAAGTGGACG TAAAGGCTAA ACCCTTCAGG GTGAATCTCC 
GGGAGTGCAG GTTCACCTGC ATTTCCGATT TGGGAAGTCC CACTTAGAGG 
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35101 TCTATAAACA TjJfcAGCACC TTCAACCATG CCCAAATAAT TCTCAIJl^G 
AGATATTTGT JBgTCGTGG AAGTTGGTAC GGGTTTATTA AGAGTAgRc 

35151 CCACCTTCTC AATATATCTC TAAGCAAATC CCQAATATTA AGTCCGGCCA 
GGTGGAAGAG TTATATAGAG ATTCGTTTAG GGCTTATAAT TCAGGCCGGT 

35201 TOSTAAAAAT ctgctccaga gcgccctcca ccttcagcct caagcagcga 

AACATTTTTA GACGAGGTCT CGCGGGAGGT GGAAGTCGGA GTTCGTCGCT 

35251 ATCATGATTG CAAAAATTCA GGTTCCTCAC AGACCTGTAT AAGATTCAAA 
TAGTACTAAC GTTTTTAAGT CCAAGGAGTG TCTGGACATA TTCTAAGTTT 



35301 AGCGGAACAT TAACAAAAAT ACCGCGATCC CGTAGGTCCC TTCGCAGGGC 
TCGCCTTGTA ATTGTTTTTA TGGCGCTAGG GCATCCAGGG AAGCGTCCCG 

35351 CAGCTCAACA TAATCGTGCA GGTCTQCACG GACCAGCGCG GCCACTTCCC 

gtcgacttgt attagcacgt ccagacgtgc ctggtcgcgc cggtgaaggg 

35401 cgccaggaac catgacaaaa gaacccacac tgattatgac acgcatactc 
gcggtccttg gtactgtttt cttgggtgtg actaatactg tgcgtatgag 

35451 ggagctatgc taaccagcgt agccccgatg taagcttgtt gcatgggcgg 

CCTCGATACG ATTGGTCGCA TCGGGGCTAC ATTCGAACAA CGTACCCGCC 

35501 CGATATAAAA TGCAAGGTGC TGCTCAAAAA ATCAGGCAAA GCCTCGCGCA 
GCTATATTTT ACGTTCCACG ACGAGTTTTT TAGTCCGTTT CGGAGCGCGT 

35551 AAAAAGAAAG CACATCGTAG TCATGCTCAT GCAGATAAAG GCAGGTAAGC 
TTTTTCTTTC GTGTAGCATC AGTACGAGTA CGTCTATrTC CGTCCATTCG 

35601 TCCGGAACCA CCACAGAAAA AGACACCATT TTTCTCTCAA ACAT6TCTGC 
AGGCCTTGGT GGTGTCTTTT TCTGTGGTAft AAAGAGAGTT TGTACAGACG 

35651 GGGTTTCTGC ATAAACACAA AATAAAATAA CAAAAAAACA TTTAAACATT 
CCCA7AGACG TATTTGTGTT TTATTTTATT GTTTTTTTGT AAATTTGTAA 

35701 AGAAGCCTGT CTTACAACAG GAAAAACAAC CCTTATAAGC ATAAGACGGA 
TCTTCGGACA GAATGTTGTC CTTTTTGTTG GGAATATTCG TATTCTGCCT 

35751 .CTACGGCCAT GCCGGCGTGA CCGTAAAAAA ACTGGTCACC GTGATTAAAA 
6ATGCCGGTA CGGCCGCACT 6GCATTTTTT TGACCAGTGG CACTAATTTT 

35B01 AGCACCACCG ACAGCTCCTC GGTCATGTCC GGAGTCATAA TGTAAGACTC 
TCGTGGTGGC TGTCGAGGAG CCAGTACAGG CCTCAGTATT ACATTCTGAG 

35851 GGTAAACACA TCAGGTTGAT TCACATCGGT CAGTGCTAAA AAGCGACCGA 
CCATTT6TGT AGTCCAACTA AGTGTAGCCA GTCACGATTT TTCGCTGGCT 

35901 AATAGCCCGG GGGAATACAT ACCCGCAGGC GTAGAGACAA CATTACAGCC 
TTATCGGGCC CCCTTATGTA TGGGCGTCCG CATCTCTGTT GTAAT6TCGG 

35951 CCCATAGGAG GTATAACAAA ATTAATAGGA GAGAAAAACA CATAAACACC 
G6GTATCCTC CATATTGTTT TAATTATCCT CTCTTTTTGT GTATTTGTGG 

36001 TGAAAAACCC TCCTGCCTAG GCAAAATA6C ACCCTCCCGC TCCAGAACAA 
ACTTTTTGGG AGGACGGATC CGTTTTATCG TGCGAGGGCG AGGTCTTGTT 
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36051 CATACAGCGC Til^IACAGCG GCAGCCATAA CAGTCAGCCT TACCAG^^ 
GTATGTCGCG AAGGTGTCGC CGTCGGTATT GTCAGTCGGA ATGGTCATTT 

36101 AAA6AAAACC TATTAAAAAA ACACCACTCG ACACGGCACC A6CTCAATCA 
TTTCTTTTG6 ATAATTTTTT T6TGGTGA6C TGTGCCGTGG TCGAGTTAGT 

36151 GTCACAGTGT AAAAAAGGGC CAAGTGCAGA GCGAGTATAT ATAGGACTAA 
CAGTGTCACA TTTTTTCCCG GTTCACGTCT CGCTCATATA TATCCTGATT 

36201 AAAATGACGT AACGGTTAAA GTCCACAAAA AACACCCAGA AAACCGCACG 
TTTTACTGCA TTGCCAATTT CAGGTGTTTT TTGTGGGTCT TTTGGCGTGC 



36251 CGAACCTACG CCCAGAAACG AAA6CCAAAA AACCCACAAC TTCCTCAAAT 
GCTTGGATGC GGGTCTTTGC TTTCGGTTTT TTGGGTGTTG AAGGAGTTTA 

36301 CGTCACTTCC GTTTTCCCAC GTTACGTCAC TTCCCATTTT AAGAAAACTA 
GCAGTGAAGG CAAAAGGGTG CAATGCAGTG AAGGGTAAAA TTCTTTTGAT 

35351 CAATTCCCAA CACATACAAG TTACTCCGCC CTAAAACCTA CGTCACCCGC 
GTTAAGGGTT GTGTATGTTC AATGAGGCGG GATTTTGGAT GCAGTGGGCG 

36401 CCCGTTCCCA CGCCCCGCGC CACGTCACAA ACTCCACCCC CTCATTATCA 
GGGCAAGGGT GCGGGGCGCG GTGCAGTGTT TGAGGTGGGG GAGTAATAGT 

Pad 



36451 TATTGGCTTC AATCCAAAAT AAGGTATATT ATTGATGATG TTAATTAAGA 
ATAACCGAAG TTAGGTTTTA TTCCATATAA TAACTACTAC AATTAATTCT 

36501 ATTCGGATCT GCGACGCGAG GCTGGATGGC CTTCCCCATT ATGATTCTTC 
TAAGCCTAGA CGCTGCGCTC CGACCTACCG GAAGGGGTAA TACTAAGAAG 

36551 TCGCTTCCGG CGGCATCGGG ATGCCCGCGT TGCAGGCCAT GCTGTCCAGG 
AGCGAAGGCC GCCGTAGCCC TACGGGCGCA ACGTCCGGTA CGACAGGTCC 

36601 CAGGTAGATG ACGACCATCA GGGACAGCTT CAAGGCCAGC AAAAGGCCAG 
GTCCATCTAC TGCTGGTAGT CCCTGTCGAA GTTCCGGTCG TTTTCCGGTC 

36651 GAACCGTAAA AAGGCCGCGT TGCTGGCGTT TTTCCATAGG CTCCGCCCCC 
CTTGCCATTT TTCCGGCGCA ACGACCGCAA AAAGGTATCC GAGGCGGGGG 

36701 CTGACGAGCA TCACAAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG 
GACTGCTCGT AGTGTTTTTA GCTGCGAGTT CAGTCTCCAC CGCTTTGGGC 

36751 ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG 
TGTCCTGATA TTTCTATGGT CCGCAAAGGG GGACCTTCGA GGGAGCACGC 

36801 CTCTCCTGTT CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC 
GAGAGGACAA GGCTGGGACG GCGAATGGCC TATGGACAGG CGGAAAGAGG 

36851 CTTCGGGAAG CGTGGCGCTT TCTCATAGCT CAC6CTGTAG GTATCTCAGT 
GAAGCCCTTC GCACC6CGAA AGAGTATC6A GTGCGACATC CATAGAGTCA 

36901 TC6GTGTAGG TCGTTCGCTC CAAGCTGGGC TGTGTGCACG AACCCCCCGT 
AGCCACATCC AQCAAGCGAG GTTC6ACCC6 ACACACGTGC TTGGGGGGCA 
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3S951 TCACCCCGAC q^fXSCGCCT TATCCGGTAA CTATCGTCTT' GAGTCQMCC 
AGTCGGGCTG GTO^CGCGGA ATAGGCCATT GATAGCAGAA CTCAGG^TOG 

37002 CGGTAAGACA CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT 
GCCATTCTGT GCTGAATAGC GGTGACCGTC GTCGGTGACC ATTGTCCTAA 

37051 AGCAGAGCGA GGTATGTAGG CGGTGC7ACA GAGTTCTTGA A6TGGTGGCC 
TCGTCTCGCT CCATACATCC GCCACGATGT CTCAAGAACT TCACCACCGG 

37101 TAACTACGQC TACACTAGAA GGACAGTATT TGGTATCTGC CCTCTCCTGA 
ATTGATGCCG ATGTGATCTT CCTGTCATAA ACCATAGACG CGAGACGACT 

37151 AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA GCTCTTGATC CGGCAAACAA 
TCGGTCAATG GAAGCCTTTT TCTCAACCAT CGAGAACTAG GCCGTTTGTT 

37201 ACCACCGCTG GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC AGATTACGCG 
TGOTGCCGAC CATCGCCACC AAAAAAACAA ACGTTCGTCG TCXAATGCGC 

37251 CAGAAAAAAA GGATCTCAA6 AAGATCCTTT GATCTTTTCT ACGGGGTCTG 
GTCTTTTTTT CCTAGAGTTC TTCTAG6AAA CTAGAAAAGA TGCCCCAGAC 

37301 ACGCTCAGTG GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA 
TGCGAGTCAC CTTGCTTTTG AGTGCAATTC CCTAAAACCA 6TACTCTAAT 

37351 'TCAAAAAGGA TCTTCACCTA GATCCTTTTA AATCAATCTA AAGTATATAT 
AGTTTTTCCT AGAAGTGGAT CTAGGAAAAT TTAGTTAGAT TTCATATATA 

37401 GAGTAAACTT GGTCTGACAG TTACCAATGC TTAATCAGTG AGGCACCTAT 
CTCATTTGAA CCAGACTGTC AATGGTTACG AATTAGTCAC TCCGTGGATA 

37451 CTCAGCGATC TGTCTATTTC GTTCATCCAT AGTTGCCTGA CTCCCCGTCG 
GAGTCGCTAG ACAGATAAAG CAAGTAGGTA TCAACGGACT GAGGGGCAGC 

37501 TGTAGATAAC TACGATACGG 6AGGGCTTAC CATCTGGCCC CAGTGCTGCA 
ACATCTATTG ATGCTATGCC CTCCCGAATG GTAGACCGGG GTCACGACGT 

37551 ATGATACCGC GAGACCCACG CTCACCGGCT CCAGATTTAT CAGCAATAAA 
TACTATGGCG CTCTGGGTGC GAGTGGCCGA GGTCTAAATA GTCGTTATTT 

37 601 CCAGCCAGCC GGAAGGGCCG AGCGCAGAAG TGGTCCTGCA ftCTTTATCCG 
GGTCGGTCGG CCTTCCCGGC TCGCGTCTTC ACCAGGACGT TGAAATAGGC 

37651 CCTCCATCCA GTCTATTMT TGTTGCCGGG AAGCTAGAGT AAGTAGTTCG 
G6AGGTAGGT CAGATAATTA ACAACGGCCC TTC6ATCTCA TTCATCAAGC 

37701 CCAGTTAATA GTTTGCGCAA CGTTGTTGCC ATTGCTACAG GCATCGTG6T 
6GTCAATTAT CAAACGCGTT GCAACAACGG TAACGATGTC CGTAGCACCA 

37751 GTCACGCTCG TCGTTTGGTA TGGCTTCATT CAGCTCCGGT TCCCAACGAT 
CAGTGCGAGC AGCAAACCAT ACCGAAGTAA GTCGAGGCCA AGGGTTGCTA 

37801 CAAGGCGAGT TACATGATCC CCCATGTTGT GCAAAAAAGC GGTTAGCTCC 
6TTCCGCTCA ATGTACTAGG GGGTACAACA CGTTTTTTCG CCAATCGAGG 

37B51 TTCGGTCCTC CGATCGTTGT CAGAAGTAAG TTGGCCGCAG TGTTATCACT 
AACCCAGGAG GCTAGCAACA GTCTTCATTC AACCGGCGTC ACAATAGTGA 
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37901 



CATGGTTATG 




lCTGC ATAATTCTCT TACTGTCATG'*t:CATC( 



GTACCAATAC cWCGTGACG TATTAAGAGA ATGACAGTAC GGTAGGWtT 

37951 GATGCTTTTC TGTGACT6GT GAGTACTCAA CCAAGTCATT CTGAGAATAG 
CTACGAAAAG ACACTGACCA CTCATGAGTT GGTTCAGTAA GACTCTTATC 

38001 TGTATGCGGC GACCGAGTTG CTCTTGCCCG GCGTCAACAC GGGATAATAC 
ACATACGCCG CTGGCTCAAC GAGAACGGGC CGCAGTTGTG CCCTATTATG 

38051 CGCGCCACAT AGCAGAACTT TAAAAGTGCT CATCATTGGA AAACGTTCTT 
GCGCGGTGTA TCGTCTT6AA ATTTTCAC6A GTAGTAACCT TTTGCAAGAA 

38101 CGGGGCGAAA ACTCTCAAGG ATCTTACCGC TGTTGAGATC CAGTTCGATG 
GCCCCGCTTT TGAGAGTTCC TAGAATGGCG ACAACTCTAG GTCAAGCTAC 

38151 TAACCCACTC GTGCACCCAA CTGATCTTCA GCATCTTTTA CTTTCACCAG 
ATTGGGTGAG CACGTGGGTT GACTAGAAGT CGTAGAAAAT GAAAGTGGTC 

38201 CGTTTCTGGG TGAGCAAAAA CAGGAAGGCA AAATGCCGCA AAAAAGGGAA 
6CAAAGACCC ACTCGTTTTT GTCCTTCCGT TTTACGGCGT TTTTTCCCTT 

38251 TAAGG6CGAC AC6GAAATGT TGAATACTCA TACTCTTCCT TTTTCAATAT 
ATTCCCGCTG TGCCTTTACA ACTTATGA6T ATGAGAAGGA AAAAGTTATA 

38301 TATTGAAGCA TTTATCA6GG TTATTGTCTC ATGAGCGGAT ACATATTTGA 
ATAACTTCGT AAATAGTCCC AATAACAGAG TACTCGCCTA TGTATAAACT 

3B351 ATGTATTTAG AAAAATAAAC AAATAGGGGT TCCGCGCACA TTTCCCCGAA 
TACATAAATC TTTTTATTTG TTTATCCCCA AGGCGCGTGT AAAGGGGCTT 

3B401 AAGTGCCACC TGACGTCTAA GAAACCATTA TTATCATGAC ATTAACCTAT 
TTCACGGTGG ACTGCAGATT CTTTGGTAAT AATAGTACTG TAATTGGATA 

38451 AAAAATAGGC GTATCACGAG GCCCTTTCGT CTTCAAGAAT TGGATCCGAA 
TTTTTATCCG CATAGTGCTC CGGGAAAGCA GAAGTTCTTA ACCTAGGCTT 



Pad 



38501 



TTCTTAATTT CTTAATTAA (SEQ ID NO: 32) 
AAGAATTAAA GAATTAATT {SEQ ID NO: 33) 
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MRKAd3nefMER1063 
(MRKAdS Pre-Adenoviral Vector Containing the G2A»LLA nef Coding Region) 



1 CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG 
GTAGTAGTTA TTATATGGAA TAAAACCTAA CTTCGGTTAT ACTATTACTC 

51 GGGGTGGAGT TTGT6ACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG 
CCCCACCTCA AACACTGCAC CGCGCCCCGC ACCCTTGCCC CGCCCACTGC 

101 TAGTAGTGTG GCX^GAAGTGT GATGTTGCAA GTGTGGCGGA ACACATGTAA 
ATCATCACAC CGCCTTCACA CTACAACGTT CACACCGCCT TGTGTACATT 

151 GCGACGGATG TGGCAAAAGT GACGTTTTTG GT6TGCGCCG GTGTACACAG 
CGCTGCCTAC ACCGTTTTCA CTGCAAAAAC CACACGC6GC CACATGTGTC 

201 GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG TAAATTTGGG 
CTTCACTGTT AAAAGCGCGC CAAAATCCGC CTACT^CATC ATTTAAACCC 

251 CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 
GCATT6GCTC ATTCTAAACC GGTAAAAGCG CCCTTTTGAC TTATTCTCCT 

301 AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA 
TCACTTTAGA CTTATTAAAA CACAATGAGT ATCGCGCATT ATAAACAGAT 

351 GGGCCGCGGG GACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT 
CCCGGCGCCC CTGAAACTGG CAAATGCACC TCTGAGCGGG TCCACAAAAA 

401 CTCAGGTGTT TTCCGCGTTC CGGGTCAAAG TTGGCGTTTT ATTATTATAG 
GAGTCCACAA AAGGCGCAAG GCCCAGTTTC AACCGCAAAA TAATAATATC 

*451 GCGGCCGCGA TCCATTGCAT ACGTTGTATC CATATCATAA TATGTACATT 
CGCCGGCGCT AGGTAACGTA TGCAACATAG GTATAGTATT ATACATGTAA 

501 TATATTGGCT CATGTCCAAC ATTACCGCCA TGTTGACATT GATTATTGAC 
ATATAACCGA GTACAGGTTG TAATGGCGGT ACAACTGTAA CT/IATAACTG 

551 TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 
ATCAATAATT ATCATTAGTT AATGCCCCAG TAATCAAGTA TCGGGTATAT 

601 TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG 
ACCTCAAGGC GCAATGTATT GAATGCCATT TACCGGGCGG ACCGACTGGC 

651 CCCAAC6ACC CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT 
GGGTTGCT6G GGGCGGGTAA CTGCAGTTAT TACTGCATAC AAGGGTATCA 

701 AACGCCAATA GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT 
TTGCGGTTAT CCCTGAAAGG TAACTGCAGT TACCCACCTC ATAAATGCCA 

•751 AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC 
TTTGACGG6T GAACCGTCAT GTAGTTCACA TAGTATACGG TTCAT6CG6G 

801 CCTATTGACG TCAAT6ACGG TAAAT6GCCC GCCTGGCATT ATGCCCAGTA 
GGATAACTGC AGTTACTGCC ATTTACCGGG CQGACCGTAA TACGGGTCAT 
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851 CATGACCTTA Tj^BACTTTC CTACTTGGCA QTACATCTAC GTATTSMcA " 
GTACTGGAAT .^RtGAAAG GATGAACCGT CATGTAGATG CATAA'tcRt 

SOI TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA 
AGCGATAATG GTACCACTAC GCCAAAACCG TCATGTAGTT ACCCGCACCT 

951 TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 
ATCGCCAAAC TGAGTGCCCC TAAAGGTTCA GAGGTGGGGT AACTGCAGTT 

1001 TGGGAGTTTG TTTTGCCACC AAAATCAACG GGACTTTCCA AAATGTCGTA 
ACCCTCAAAC AAAACCGTGG TTTTAGTTGC CCTGAAAGGT TTTACAGCAT 

1051 ACAACTCCGC CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG 
TGTTGAGGCG GGGTAACTGC GTTTACCCGC CATCCGCACA TGCCACCCTC 

1101 GTCTATATAA GCAGAGCTCG TTTAGTGAAC CGTCAGATCG CCTGGAGACG 
CAGATATATT CGTCTCGAGC AAATCACTTG GCAGTCTAGC GGACCTCTGC 

1151 CCATCCACGC TGTTTTGACC . TCCATAGAAG ACACCGGGAC CGATCCAGCC 
GGTAGGTGCG ACAAAACTGG AGGTATCTTC TGTGGCCCTG GCTAGGTCGG 

1201 TCCGCGGCCG GGAACGGTGC ATTGGAACGC GGATTCCCCG TGCCAAGAGT 
AGGCGCCGGC CCTTGCCACG TAACCTTGCG CCTAAGGGGC ACGGTTCTCA 

1251 GAGATCTGCC ACCATGGCCG GCAAGTGGTC CAAGAGGTCC GTGCCCGGCT 
CTCTAGACGG TGGTACCGGC CGTTCACCAG GTTCTCCAGG CACGGGCCGA 

1301 GGTCCACCGT GAGGGAGAGG ATGAGGAGGG CCGAGCCCGC CGCCGACAGG 
CCAGGTGGCA CTCCCTCTCC TACTCCTCCC GGCTCGGGCG GCGGCTGTCC 

1351 GTGAGGAGGA CCGAGCCCGC CGCAGTGGGC GTGGGCGCCG TGTCCAGGGA 
CACTCCTCCT GGCTCGGGCG GCGTCACCCG CACCCGCGGC ACAGGTCCCT 

1401 CCTGGAGAAG CACGGCGCCA TCACCTCCTC CAACACCGCC GCCACCAACG 
GGACCTCTTC GTGCCGCGGT AGTGGAGGAG GTTGTGGCGG CGGTGGTTGC 

IflSl CCGACTGCGC CTGGCTGGAG GCCCAGGAGG ACGAGGAGGT GGGCTTCCCC 
GGCTGACGCG GACCGACCTC CGGGTCCTCC TGCTCCTCCA CCCGAAGGGG 

1501 GTGAGGCCCC AGGTGCCCCT GAGGCCCATG ACCTACAAGG GCGCCGTGGA 
CACTCCGGGG TCCACGGGGA CTCCGGGTAC TGGATGTTCC CGCGGCACCT 

1551 CCTGTCCCAC TTCCTGAAGG AGAAGGGCGG CCTGGAGGGC CTGATCCACT 
GGACAGGGTG AAGGACTTCC TCTTCCCGCC GGACCTCCCG GACTAGGTGA 

X601 CCCAGAAGAG GCAGGACATC CTGGACCTGT GGGTGTACCA CACCCAGGGC 
GGGTCTTCTC CGTCCTGTAG GACCTGGACA CCCACATGGT GTGGGTCCCG 

1651 TACTTCCCCG ACTGGCAGAA CTACACCCCC GGCCCCGGCA TCAGGTTCCC 
ATGAAQGGGC TGACCGTCTT GATGTGGGGG CCGGGGCCGT AGTCCAAGGG 

1701 CCTGACCTTC GGCTGGTGCT TCAAGCTGGT GCCCGTGGAG CCCGAGAAGG 
GGACTGGAAG CCGACCACGA AGTTCGACCA C66GCACCTC CCGCTCTTCC 

1751 TGGAGGAGGC CAACGAGGGC GAGAACAACT GCGCCGCCCA CCCCATGTCC 
ACCTCCTCCG GTTGCTCCCG CTCTTGTTGA CGCGGCGGGT GGGGTACAGG 



97/144 



wo 02/22080 



PCTAJSOl/28861 



I80I CAGCACGGCA r!|^GGACCC CGAGAAGGAG GTGCTGGAGT GGAGGOj^KA 
GTCGTGCCGT aRtCCTGGG GCTCTTCCTC CACGACCTCA CCTCCwRt 

1851 CTCCAAGCTG GCCTTCCACC ACGTGGCCAG GGAGCTGCAC CCCGAGTACT 
GAGGTTCGAC CGGAAGGTGG TGCACCGGTC CCTCGACGTG GGGCTCATGA 

1901 ACAAGGACTG CTAAAGCCCG GGCAGATCTG CTGTGCCTTC TAGTTGCCAG 
TGTTCCTGAC GATTTCGG6C CCGTCTAGAC GACACGGAAG ATCAACGGTC 

1951 CCATCTGTT6 TTTGCCCCTC CCCCGTGCCT TCCTTGACCC TG6AAGGTGC 
GGTAGACAAC AAACGGGGA6 GGGGCACG6A AGGAACTGGG ACCTTCCACG 

2001 CACTCCCACT GTCCTTTCCT AATAAAATGA GGAAATTGCA TCGCATTGTC 
GTGAGGGTGA CAGGAAAGGA TTATTTTACT CCTTTAACGT AGCGTAACAG 

2051 TGAGTAGGTG TCATTCTATT CTGGGGGGTG GGGTGGGGCA GGACAGCAAG 
ACTCATCCAC ACTAAGATAA GACCCCCCAC CCCACCCCGT CCTGTCGTTC 

2101 GGGGAGGATT GGGAAGACAA TAGCAGGCAT GCTGGGGATG CGGTGGGCTC 
CCCCTCCTAA CCCTTCTGTT ATCGTCCGTA CGACCCCTAC GCCACCCGAG 

2151 TATGGCCGAT CGGCGCGCCG TACTGAAATG TGTGGGCGTG GCTTAAGGGT 
ATACCGGCTA GCCGCGC6GC ATGACTTTAC ACACCCGCAC CGAATTCCCA 

2201 GCGAAACAAT ATATAAGGTG GGGGTCTTAT GTAGTTTTGT ATCTGTTTTG 
CCCTTTCTTA TATATTCCAC CCCCAGAATA CATCAAAACA TAGACAA7AC 

2251 CAGCAGCCGC CGCCGCCATG AGCACCAACT CGTTTGATGG AAGCATTGTG 
GTCGTCGGCG GCGGCGGTAC TCGTGGTTGA GCAAACTACC TTCGTAACAC 

2301 AGCTCATATT TGACAACGCG CATGCCCCCA TGGGCCGGGG TGCGTCAGAA 
TCGAGTATAA ACTGTTGCGC GTACGGGGGT ACCCGGCCCC ACGCAGrCTT 

2351 TGTGATGGGC TCCAGCATTG ATGGTCGCCC CGTCCTGCCC GCAAACTCTA 
ACACIACCCG AGGTCGTAAC TACCAGCGGG GCAGGACGGG CGTTTGAGAT 

2401 CTACCTTGAC CTACGAGACC GTGTCTGGAA CGCCGTTGGA GACTGCAGCC 
GATGGAACTG GATGCTCTGG CACAGACCTT GCGGCAACCT CT6ACGTCGG 

2451 TCCGCCGCCG CTTCAGCCGC TGCA6CCACC GCCCGCG6GA TTGTGACTGA 
AG6CGGCGGC GAAGTC6GC6 ACGTCGGT6G CGG6CGCCCT AACACTGACT 

2501 CTTTGCTTTC CTGAGCCCGC TTGCAAACAG TGCAGCTTCC CGTTCATCCG 
GAAACGAAAG GACTCGGGCG AACGTTTGTC ACGTCGAAGG GCAAGTAGGC 

2551 CCCGCGATGA CAAGTTGACG GCTCTTTTGG CACAATTGGA TTCTTTGACC 
GGGCGCTACT GTTCAACTGC CGAGAAAACC GTGTTAACCT AAGAAACTGG 

2601 CGGGAACTTA ATGTCGTTTC TCAGCAGCTG TTGGATCTGC GCCAGCAGGT 
6CCCTTGAAT TACAGCAAAG AGTCGTCGAC AACCTAGACG CGGTCGTCCA 

2651 TTCTGCCCTG AAG6CTTCCT CCCCTCCCAA TGCGGTTTAA AACATAAATA 
AAGACGGGAC TXCCGAAGGA GGGGAGGGTT ACGCCAAATT TTGTATTTAT 

2701 AAAAACCAGA CTCT6TTTGG ATTTGGATCA AGCAAGTGTC TTGCTGTCTT 
TTTTTGGTCT GAGACAAACC TAAACCTAGT TCGTTCACAG AACGACAGAA 
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2751 TATTTAGGGG tW^GCGCGC GCGGTAGGCC CGGGACCAGC GGTCTCSITC 
ATAAATCCCC AAAACGCGCG CGCCATCCGG 6CCCT6GTCG CCAGAGCCAG 

2801 GTTGAGGGTC CTGTGTATTT TTTCCAGGAC GTGGTAAAGG TGACTCTGGA 
CAACTCCCAG GACACATAAA AAAGGTCCTG CACCATTTCC ACT6AGACCT 

2851 TGTTCAGATA CATGGGCATA AGCCCGTCTC TG6GGTGGAG GTAGCACCAC 
ACAAGTCTAT 6TACCCGTAT TCGGGCAGAG ACCCCACCTC CATCGTGGTG 

2901 TGCAGAGCTT CATGCTGCGG GGTGGTGTTG TAGATGATCC AGTCGTAGCA 
ACGTCTCGAA GTACGAC6CC CCACCACAAC ATCTACTAGG TCAGCATCGT 

2951 GGAGCGCTGG GCGTG6TGCC TAAAAATGTC TTTCAGTAGC AAGCTGATTG 
CCTCGCGACC CGCACCACGG ATTTTTACAG AAAGTCATCG TTCGACTAAC 

3001 . CCAGGGGCAG CCCCTTGGTG TAAGTCmTA CAAAGCGGTT AAGCTGGGAT 
G6TCCCCGTC CGGGAACCAC ATTCACAAAT GTTTCGCCAA TTCGACCCTA 

3051 GGGT6CATAC 6TGG6GATAT GAGATGCATC TTGGACTGTA TTTTTAGGTT 
CCCACGTATG CACCCCTATA CTCTACGTAG AACCTGACAT AAAAATCCAA 

3101 GGCTATGTTC CCAGCCATAT CCCTCCGGGG ATTCATGTTG TGCAGAACCA 
CCGATACAAG GGTCGGTATA GGGAGGCCCC TAAGTACAAC ACGTCTTGGT 

3151 CCAGCACAGT GTATCCGGTG CACTTGGGAA ATTTGTCATG TAGCTTAGAA 
GGTCGTGTCA CATAGGCCAC GTGAACCCTT TAAACAGTAC ATCGAATCTT 

3201 GGAAATGCGT GGAAGAACTT GGAGACGCCC TTGTGACCTC CAAGATTTTC 
CCTTTACGCA CCTTCTTGAA CCTCTGCGGG AACACTGGAG GTTCTAAAAG 

3251 CATGCATTCG TCCATAATGA TG6CAAT6GG CCCACGGGCG GCGGCCTGGG 
GTACGTAAGC AGGTATTACT ACCCTTACCC G6GTGCCC6C CGCCGGACCC 

3301 CGAAGATATT TCTGGGATCA CTAACGTCAT AGTTGTGTTC CAGGATGAGA 
GCTTCTATAA AGACCCTAGT GATTGCAGTA TCAACACAAG GTCCTACTCT 

3351 TCGTCATAGG CCATTTTTAC AAAGCGCGGG CGGAGGGTGC CAGACTGCGG 
AGCAGTATCC GGTAAAAATG TTTCGCGCCC GCCTCCCACG GTCTGACGCC 

3401 TATAATGGTT CCATCCGGCC CAGGGGCGTA GTTACCCTCA CAGATTTGCA 
ATATTACCAA GGTAGGCCG6 GTCCCCGCAT CAATGGGAGT GTCTAAACGT 

3451 TTTCCCACGC TTTGAGTTCA GATGGGGGGA TCATGTCTAC CTGCGGGGCG 
AAAGGGTGCG AAACTCAAGT CTACCCCCCT AGTACAGATG GACGCCCCGC 

3501 ATGAAGAAAA CGGTTTCCGG GGTAGGGGA6 ATCAGC7GGG AAGAAA6CAG 
TACTTCTTTT GCCAAAGGCC CCATCCCCTC TAGTCGACCC TTCTTTCGTC 

3551 GTTCCTGAGC AGCTGC6ACT TACCGCAGCC GGTGGGCCCG TAAATCACAC 
CAAGGACTCG TCGACGCTGA ATGGCGTCGG CCACCCGGGC ATTTAGTGTG 

3601 CTATTACCGG CTGCAACTGG TAGTTAAGAG AGCTGCAGCT GCCGTCATCC 
GATAATGGCC GACGTTGACC ATCAATTCTC TCGACGTCGA CGGCAGTAGG 

3651 CTGAGCAGGG GGGCCACTTC GTTAAGCATG TCCCTGACTC GCATGTTTTC 
QACTCGTCCC CCCGGTGAAG CAATTCGTAC ACGGACTCAG CGTACAAAAG 
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3701 CCTGACCAAA 'flfeCCAGAA GGC6CTC6CC OCCCAGCGAT AGCAG^^B^ 
GGACTGGTTT AGGCGGTCTT CCGCGAGCGG CGGGTCGCTA TCGTCAAGAA 

3751 GCAAGGAAGC AAAGTTTTTC AAC6GTTTGA GACCGTCCGC CGTAGGCATG 
CGTTCCTTCG TTTCAAAAAG TTGCCAAACT CTGGCAGGCG GCATCC6TAC 

3B01 CTTTTGAGCG TTTGACCAAG CAGTTCCAGG CGGTCCCACA GCTCGGTCAC 
GAAAACTCGC AAACTGGTTC GTCAAGGTCC GCCAGGGTGT CGAGCCAGTG 

3851 CTGCTCTACG GCATCTCGAT CCAGCATATC TCCTCGTTTC GCGGGTTGGG 
GACGAGATGC CGTAGAGCTA GGTCGTATAG AGGAGCAAAG CGCCCAACCC 

3901 GCG6CTTTC6 CTGTACGGCA GTAGTCG6TG CTCGTCCAGA CGGGCCA66G 
CGCCGAAAGC GACATGCC6T CATCAGCCAC GAGCAGGTCT 6CCCGGTCCC 

3951 TCATGTCTTT CCACGGGOGC AGGGTCCTCG TCAGCCTAGT CTGGGTCACG 
AGTACAGAAA GGTGCCC6CG TCCCAGGA6C AGTCGCATCA GACCCAGTGC 

4001 GTGAAGGGGT GCGCTCCGGG CTGCGCGCTG GCCAGGGTGC GCTTGAGGCT 
CACTTCCCCA CGCGAGGCCC GACGCGCGAC CGGTCCCACG CGAACTCCGA 

4051 GGTCCTGCTG GTGCTGAAGC GCTGCCGGTC TTCGCCCTGC GCGTCGGCCA 
CCAGGACGAC CACGACTTCG C6ACGGCCAG AAGCGGGACG CGCAGCC6GT 

4101 GGTAGCATTT GACCATGGTG TCATAGTCCA GCCCCTCCGC GGCGTGGCCC 
CCATCGTAAA CTGGTACCAC AGTATCAGGT CGGGGAGGCG CCGCACCGGG 

4151 TTGGCGCGCA GCTTGCCCTT GGAGGAGGCG CCGCACGAGG GGCAGTGCAG 
AACCGCGCGT CGAACGGGAA CCTCCTCCGC GGCGTGCTCC CCGTCACGTC 

4201 ACTTTTGAGG GC6TAGAGCT TGGGCGCGAG AAATACCGAT TCCGGGGAGT 
TGAAAACTCC CGCATCTCGA ACCCGCGCTC TTTATGGCTA AGGCCCCTCA 

4251 AGGCATCCGC GCCGCAGGCC CCGCAGACGG TCTCGCATTC CACGAGCCAG 
TCCGTAGGCG CGGCGTCCGG GGCGTCTGCC AGAGCGTAAG GTGCTCGGTC 

4301 GTGAGCTCTG GCCGTTCGGG GTCAAAAACC AGGTTTCCCC CATGCTTTTT 
CACTCGAGAC CGGCAAGCCC CAGTT7TTGG TCCAAA6GGG GTACGAAAAA 

4351 GATGCGTTTC TTACCTCTGG TTTCCATGAG CCGGTGTCCA CGCTCGGTGA 
CTACGCAAAG AATGGAGACC AAAGGTACTC GGCCACAGGT GCGAGCCACT 

4401 CGAAAAGGCT GTCCGTGTCC CCGTATACAG ACTTGAGAGG CCTGTCCTCG 
GCTTTTCCGA CAGGCACAGG GGCATATGTC TGAACTCTCC GGACAGGAGC 

4451 AGCGGTGTTC CGCGGTCCTC CTCGTATAGA AACTCGGACC ACTCTGAGAC 
TCGCCACAAG GCGCCAGGAG GAGCATATCT TTGAGCCTGG TGAGACTCTG 

4501 AAAGGCTCGC GTCCAGGCCA GCACGAAGGA GGCTAAGTGG GAGGGGTAGC 
TTTCCGAGCG CAGGTCCGGT CGTGCTTCCT CCGATTCACC CTCCCCATCG 

4551 GGTCGTTGTC CACTAGG66G TCCACTCGCT CCAGGGTGTG AA6ACACATG 
" CCAGCAACAG GTGATCCCCC AGGTGAGCGA GGTCCCACAC TTCTGTGTAC 

4601 TCGCCCTCTT CGOCATCAAG GAAGGTGATT GGTTTGTAGG TGTAGGCCAC 
AGCGGGAGAA GCCGTAGTTC CTTCCACTAA CCAAACATCC ACATCCGGTG 
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4651 GTGACCGGGT ^tCTGAAG GG6GGCTATA AAAGGG6GTG GGGGC^V^ 
CACTCGCCCA CAAGGACTTC CCCCCGATAT TTTCCCCCAC CCCCGCGCAA 

4701 CGTCCTCACT CTCTTCCGCA TCGCTGTCTG CGAGGGCCAG CTGTTGGGGT 
GCAGGAGTGA GAGAAGGCGT AGCGACAGAC GCTCCCGGTC GACAACCCCA 

4751 GAGTACTCCC TCTGAAAAGC GGGCATGACT TCTGCGCTAft GATTGTCAGT 
CTCATGAGGG AGACTTTTCG CCCGTACTGA AGACGCGATT CTAACAGTCA 

4B01 TTCCAAAAAC GAGGAGGATT TGATATTCAC CTGGCCCGCG GTGATGCCTT 
AAGGTTTTTG CTCCTCCTAA ACTATAAGTG GACCGGGCGC CACTACGGAA 

4851 TGAG6GTGGC CGCATCCATC TGGTCAGAAA AGACAATCTT TTTGTTGTCA 
ACTCCCACCG GCGTAGGTAG ACCAGTCTTT TCTGTTAGAA AAACAACAGT 

4901 AGCTTGGTGG CAAACGACCC GTAGAGGGCG TT6GACAGCA ACTTGGCGAT 
TCGAACCACC GTTTGCTGGG CATCTCCCGC AACCTGTCGT TGAACCGCTA 

4951 GGAGCGCAGG GTTTGGTTTT TGTCGCGATC GGCGCGCTCC TTGGCCGCGA 
CCTCGCGTCC CAAACCAAAA ACAGCGCTAG CCGCGCGAGG AACCGGCGCT 

5001 . TGTTTAGCTG CACGTATTCG CGCGCAACGC ACCGCCATTC GGGAAAGACG 
ACAAATCGAC 6TGCATAAGC GCGCGTTGCG TGGCGGTAAG CCCTTTCTGC 

5051 GTGGTGCGCT CGTCGGGCAC CAGGTGCACG CGCCAACCGC GGTTGTGCAG 
CACCACGCGA GCAGCCCGTG GTCCACGTGC GCGGTTGGCG CCAACACGTC 

5101 GGTGACAAGG TCAACGCTGG TG6CTACCTC TCCGCGTAGG CGCTCGTTGG 
CCACTGTTCC AGTTGCGACC ACCGATGGAG AGGCGCATCC GCGAGCAACC 

5151 TCCAGCAGAG GCGGCCGCCC TTGCGCGAGC AGAATGGCGG TAGGGGGTCT 
AGGTCGTCTC CGCCGGCGGG AACGCGCTCG TCTTACCGCC ATCCCCCAGA 

5201 AGCTGCGTCT CGTCCGGGGG GTCTGCGTCC ACGGTAAAGA CCCCGGGCAG 
TCGACGCAGA GCAGGCCCCC CAGACGCAGG TGCCATTTCT GGGGCCCGTC 

5251 CAGGCGCGCG TCGAAGTAGT CTATCTTGCA TCCTTGCAAG TCTAGCGCCT 
GTCCGCGCGC AGCTTCATCA GATAGAACGT AGGAACGTTC AGATCGC6GA 

5301 GCTGCCATGC OCGGGCGGCA AGCGCGCGCT CGTATGGGTT 6AGTGGGGGA 
CGACGGTACG C6CCCGCCGT TCGCGCGC6A GCATACCCAA CTCACCCCCT 

5351 CCCCATGGCA TGGGGTGGGT GAGCGCGGftG GCGTACATGC CGCAAATGTC 
GGGGTACCGT ACCCCACCCA CTCGC6CCTC CGCATGTACG QCGTTTACAG 

5401 GTAAACGTAG AGGGGCTCTC TGAGTATTCC AAGATATGTA GGGTAGCATC 
CATTTGCATC TCCCC6AGAG ACTCATAAGG TTCTATACAT CCCATCGTAG 

5451 TTCCACCGCG GATGCTGGCG CGCACGTAAT CGTATAGTTC GTGCGAGGGA 
AAGGTGGCGC CTACGACCGC GCGTGCATTA GCATATCAAG CACGCTCCCT 

5501 GC6AGGAGGT CGGGACC6AG GTTGCTACGG GCGGGCTGCT CTGCTCGGAA 
CGCTCCTCCA 6CCCTCGCTC CAACGATGCC CGCCCGACGA GACGAGCCTT 

5551 GACTATCTGC CTGAAGATGG CATGTGAGTT GGATGATATG GTTGGACGCT 
CTGATAGACG GACTTCTACC GTACACTCAA CCTACTATAC CAACCTGCGA 
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5601 GGAAGACGTT ^fccTGGCG TCTGTGAGAC CTACCGCGTC ftCGCAOM^G 
CCTTCTGCAA cWCGACCGC AGACACTCTG GATGGCGCAG TCCGTG^^ 

5 651 6AGGCGTAGG AGTCGCGCAG CTTGTTGACC AGCTCGGCGG TGACCTGCAC 
CTCCGCATCC TCAGCGCGTC 6AACAACTGG TCGA6CCGCC ACTGGACGTG 

5701 GTCTAGGGCG CAGTAGTCCA GGGTTTCCTT GATGATGTCA TACTTATCCT 
CAGATCCCGC 6TCATCAGGT CCCAAAGGAA CTACTACAGT ATGAATAGGA 

5751 GTCCCTTTTT TTTCCACAGC TCGCGGTTGA GGACAAACTC TTCGCGGTCT 
CAGGGAAAAA AAAGGTGTCG AGCGCCAACT CCTGTTTGAG AAGCGCCAGA 

5801 TTCCAGTACT CTT6GATCGG AAACCCGTCG GCCTCCGAAC GGTAAGAGCC 
AAGGTCATGA 6AACCTAGCC TTTGGGCAGC CG6AGGCTTG CCATTCTCGG 

5851 TAGCATGTAG AACTGGTTGA CGGCCTGGTA GGCGCAGCAT CCCTTTTCTA 
ATCCTACATC TTGACCAACT CCCGGACCAT CCGCGTCGTA 6GGAAAAGAT 

5901 CG6GTAGCGC GTATGCCTGC GCGGCCTTCC GGAGCGAGGT GTGGGTGAGC 
GCCCATCGCG CATACGGACG CGCC6GAAGG CCTCGCTCCA CACCCACTCG 

5951 GCAAAGGTGT CX:CTGACCAT GACTTTGAGG TACTGGTATT T6AAGTCAGT 
CGTTTCCACA GGGACTGGTA CTGAAACTCC ATGACCATAA ACTTCAGTCA 

6001 GTCGTCGCAT CCGCCCT6CT CCCAGAGCAA AAAGTCCGTG CGCTTTTTGG 
CAGCAGCGTA GGCGGGACGA QGGTCTCGTT TTTCAGGCAC GCGAAAAACC 

6051 AACGCGGATT TGGCAGGGCG AAGGTGACAT CGTTGAAGAG TATCTTTCCC 
TTGCGCCTAA ACCGTCCCGC TTCCACTGTA GCAACTTCTC ATAGAAAGGG 

6101 GCGCGAGGCA TAAAGTTGCG TGTGATGCGG AAGGGTCCCG GCACCTCGGA 
CGCGCTCCGT ATTTCAACGC ACACTACGCC TTCCCAGGGC CGTGGAGCCT 

6151 ACGGTTGTTA ATTACCTGGG CGGCGAGCAC GATCTCGTCA AAGCCGTTGA 
TGCCAACAAT TAATGGACCC GCCGCTCGTG CTAGAGCAGT TTCGGCAACT 

6201 TGTTGTGGCC CACAATGTAA AGTTCCAAGA A6CGCGGGAT GCCCTTGATG 
ACAACACCGG GTGTTACATT TCAAGGTTCT TC6CGCCCTA CGGGAACTAC 

6251 GAAGGCAATT TTTTAAGTTC CTCGTAGGT6 AGCTCTTCAG GGGAGCTGAG 
CTTCCGTTAA AAAATTCAAG GAGCATCCAC TC6AGAAGTC CCCTCGACTC 

6301 CCCGTGCTCT GAAAGGGCCC AGTCTGCAAjG ATGAGGGTTG GAAGCGACGA 
G6GCACGA6A CTTTCCCG6G TCAGACGTTC TACTCCCAAC CTTCGCTGCT 

6351 ATGAGCTCCA CAGGTCACGG GCCATTA6CA TTTGCAGGTG 6TC6C6AAAG 
TACTCGAGGT GTCCAGTGCC CGGTAATCGT AAACGTCCAC CAGCGCTTTC 

6401 GTCCTAAACT GGCGACCTAT GGCCATTTTT TCTGCGGTGA TGCAGTAGAA 
CAGGATTTGA CCGCTGGATA CCGGTAAAAA AGACCCCACT ACGTCATCTT 

6451 GGTAAGCGGG TCTTGTTCCC AGCGGTCCCA TCCAAGGTTC GCGGCTAGGT 
CCATTCGCCC AGAACAAGGG TCGCCA6GGT AGGTTCCAAG CGCCGATCCA 

6501 CTCGCGCGGC AGTCACTAQA GGCTCATCTC CGCCGAACTT CATGACCACC 
GAGCGCGCCG TCAGTGATCT CCGAGTAGAG GCGGCTTGAA GTACTGGTCG 
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6551 



ATGAAGGGCA. C^fccTGCTT CCCAAAGGCC CCCATCCAAG ThTAGC^^C 
TACTTCCCGT (sSfcGACGAA GGGTTTCCGG GGGTAGGTTC ATATCCSGXg 



6601 tacatcgtag gtgacaaaga gacgctcggt gcgaggatgc gagccgatcg 
atgtagcatc cactgtttct ctgcgagcca cgctcctacg ctcggctagc 

6651 ggaagaactg . gatctcccgc caccaattg6 aggagtggct attgatgtgg 
ccttcttgac ctagagggcg gtggttaacc tcctcaccga taactacacc 

6701 tgaaagtaga agtccctgcg acgggccgaa cactcgtgct ggcttttgta 

ACTTTCATCT TCAGGGACGC TGCCCGGCTT GTGAGCACGA CCGAAAACAT 

6751 AAAACGTGCG CAGTACTGGC AGCGGTGCAC GGGCTGTACA TCCTGCACGA 
TTTTGCACGC GTCATGACC6 TCGCCACGTG CCCGACAT6T AGGACGTGCT 

6801 GGTTGACCTG ACGACCGCGC ACAAGGAAGC AGAGTGGGAA TTTGAGCCCC 
.CCAACTGGAC TGCTGGCGCG TGTTCCTTCG TCTCACCCTT AAACTCGGGG 

6851 TCGCCTGGCG GGTTTGGCTG GTGGTCTTCT ACTTCGGCTG CTTGTCCTTG 
AGCGGACCGC CCAAACCGAC CACCAGAAGA TGAAGCCGAC GAACAGGAAC 

6901 ACCGTCTGGC TGCTCGAGGG GAGTTACGGT GGATCGGACC ACCACGCCGC 
TGGCAGACCG ACGAGCTCCC CTCAATGCCA CCTAGCCTCG TGGTGCGGCG 

6951 GCGAGCCCAA AGTCCAGATG TCCGCGCGCG GCGGTCGGAG CTTGATCACA 
CGCTCGGGTT TCAGGTCTAC AGGCGCGCGC CGCCAGCCTC GAACTACTGT 

7 001 ACATCGCGCA GATGGGAGCT GTCCATGGTC TGGAGCTCCC GCGGCGTCAG 
TGTAGCGCGT CTACCCTCGA CAGGTACCAG ACCTCGAGGG C6CCGCAGTC 

7051 GTCAGGCGGG AGCTCCTGCA GGTTTACCTC GCATAGACGG GTCAGGGCGC 
CAGTCCGCCC TCCA6GACGT CCAAATGGAG CGTATCTGCC CAGTCCCGCG 

7101 GGGCTAGATC CAGGTGATAC CTAATTTCCA GGGGCTGGTT GGTGGCGGCG 
CCCGATCTAG GTCCACTATG GATTAAAGGT CCCCGACCAA CCACCGCCGC 

7151 TCGATGGCTT GCAAGAGGCC GCATCCCCGC GGCGCGACTA CGGTACCGCG 
AGCTACCGAA CGTTCTCCGG CGTAGGGGCG CCGCGCTGAT GCCATGGCGC 

7201 CGGCGGGCGG TGGGCCGCGG GGGTGTCCTT GGATGATGCA TCTAAAAGCG 
.GCCGCCCGCC ACCCGGCGCC CCCACAGGAA CCTACTACGT AGATTTTCGC 

7251 GTGACGCGGG CGAGCCCCCG GAGGTAGGGG G6GCTCCGGA CCCGCCGGGA 
CACTGCGCCC GCTCGGGGGC CTCCATCCCC CCCGAGGCCT GGGCGGCCCT 

7301 GAGGGGGCA6 GGGCACGTCG GCGCCGCGCG CGGGCAGGAG CTGGTGCTGC 
CTCCCCCGTC CCCGTGCAGC CGC6GCGCGC GCCCGTCCTC GACCACGACG 

7351 GCGCGTAGGT TGCTGGC6AA CGCGACGACG CGGCGGTTGA TCTCCTGAAT 
CGCGCATCCA ACGACCGCTT GCGCTGCTGC GCCGCCAACT AGAGGACTTA 

7401 CTGGCGCCTC TGCGTGAAGA CGACGGGCCC GGTGAGCTTG AACCTGAAAG 
GACCGCGGAG ACGCACTTCT GCTGCCCGGG CCACTCGAAC TTGGACTTTC 

7451 AGAGTTCGAC AGAATCAATT TCGGTGTCGT TGACGGCGGC CTGGCGCAAA 
TCTCAAGCTG TCTTAGTTAA AGCCACAGCA ACTGCCGCCG GACCGCGTTT 
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7501 ATCTCCTGCA C^KrTCCTGA GTTGTCTTGA TAGGCG^eTOJl> QG<^^%^SiiMJ^ 
tagaggacgt' (SRaggact CAACAGAACT ATCCGCTAGA GCCGGT'^WT 

7551 CTGCTCGATC TCTTCCTCCT GGAGATCTCC GCGTCCGGCT cgctccacgg 
GACGAGCTAG AGAAGGAGGA CCTCTAGAGG C6CAGGCCGA GCGAGGTGCC 

7601 TGGCGGCGAG GTCGTTGGAA ATGCGGGCCA TGAGCTGCGA 6AAGGCGTT6 
ACCGCCGCTC CAGCAACCTT TACGCCCGGT ACTCGACGCT CTTCCGCAAC 

7651 AGGCCTCCCT C6TTCCAGAC GCGGCTGTAG ACCACGCCCC CTTCGGCATC 
TCCGGAGGGA GCAAGGTCTG CGCCGACATC TGGTGCGGGG GAAGCCGTAG 

7701 GCGGGCGCGC ATGACCACCT GCGCGAGATT GAGCTCCACG TGCCGGGCGA 
CGCCCGCGCG TACTG6T6GA CX^CGCTCTAA CTCGAGGTGC ACGGCCCGCT 

7751 AGACGGCGTA GTTTCGCAGG CGCTGAAAGA GGTAGTTGAG GGTGGTGGCG 
TCTGCCGCAT CAAAGCGTCC GCGACTTTCT CCATCAACTC CCACCACCGC 

7801 GTGTGTTCTG CCACGAAGAA GTACATAACC CAGCGTCGCA ACGTGGATTC 
CACACAAGAC GGTGCTTCTT CATGTATTGG GTCGCAGCGT TGCACCTAAG 

7851 GTTGATATCC CCCAAGGCCT CAAGGCGCTC CATGGCCTCG TAGAAGTCCA 
CAACTATAGG GGGTTCCGGA GTTCCGCGA6 GTACCGGAGC ATCTTCAGGT 

7901 CGGCGAAGTT GAAAAACTGG GAGTTGCGCG CCGACACGGT TAACTCCTCC 
GCCGCTTCAA CTTTTTGACC CTCAACGCGC GGCTGTGCCA ATTGAGGAGG 

7951 TCCAGAAGAC GGATGAGCTC GGCGACAGTG TCGCGCACCT CGCGCTCAAA 
AGGTCTTCTG CCTACTCGAG CCGCTGTCAC AGCGCGTGGA GCGCGAGTTT 

8001 6GCTACAGGG GCCTCTTCTT CTTCTTCAAT CTCCTCTTCC ATAAGGGCCT 
CCGATGTCCC CGGAGAAGAA GAAGAACTTA GAGGAGAAGG TATTCCCGGA 

8051 CCCCTTCTTC TTCTTCTGGC GGCGGTGGGG GAGGGGGGAC ACGGCG6CGA 
GGGGAAGAAG AAGAAGACCG CCGCCACCCC CTCCCCCCTG TGCCGCCGCT 

8101 C6ACGGCGCA CCGGGAGGCG GTCGACAAAG CGCTCGATCA TCTCCCCGCG 
GCTGCCGCGT GGCCCTCCGC CAGCTGTTTC GCGAGCTAGT AGAGGGGCGC 

8151 GCGACGGCGC ATGGTCTCGG TGACGGCGCG GCCGTTCTCG CGGGGGCGCA 
CGCTGCCGCG TACCAGAGCC ACTQCCGCQC CGGCAAGAGC GCCCCCGCGT 

8201 GTTG6AAGAC GCCGCCCGTC ATGTCCCQQT TATGGGTTGG CGGGGGGCTG 
CAACCTTCTG CGGCGGGCAG TACAGGGCCA ATACCCAACC GCCCCCCGAC 

8251 CCATGC6GCA GG6ATACGGC GCTAACGATG CATCTCAACA ATTGTTGTGT 
GGTACGCCGT CCCTATGCCG CGATTGCTAC GTAGAGTTGT TAACAACACA 

8301 AGGTACTCCG CCGCCGAGGG ACCTGAOCGA GTCCGCATCG ACC0QATC6G . 
TCCATGAGGC GGCGGCTCCC TGGACTCGCT CAGGCGTAGC TGGCCTAGCC 

8351 AAAACCTCTC GAGAAAGGCG TCTAACCAGT CACAGTCGCA AGGTAGGCTG 
. . . TTTTGGAGAG CTCTTTCCGC AGATTGGTCA GTGTCAGCGT TCCATCCGAC 

8401 AGCACCGTGG CGGGCGGCAG CGGGCGGCGG TCGGGGTTGT TTCTGGCGGA 
TCGTGGCACC GCCCGCCGTC GCCCGCCGCC AGCCCCAACA AAGACCGCCT 
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8451 GGTGCTGCTG J^KkTGrAAT TAAAGTAGGC GGTCTTGAGA CGGCGC^9G 
CCACGACGAC "f^ThCATTh ATTTCATCCG CCAGAACTCT GCCGCCTTOC 

8501 TCGACAiSAAG CACCATGTCC TTGGGTCC6G CCTGCTGAAT GCGCAGGCGG 
AGCTGTCTTC GTG6TACA6G AACCCAG6CC GGACGACTTA CGCGTCCGCC 

8551 TCGGCCATGC CCCAGGCTTC GTTTT6ACAT CGGCGCAGGT CTTTGTAGTA 
AGCCGGTACG GGGTCC6AAG CAAAACTGTA GCC6CGTCCA GAAACATCAT 

8601 GTCTTGCATG AGCCTTTCTA CCG6CACTTC TTCTTCTCCT TCCTCTTGTC 
CAGAACGTAC TCG6AAAGAT GGCCGTGAAG AAGAAGAGGA AGGAGAACAG 

8651 CTGCATCTCT TGCATCTATC GCTGCGGCGG CGGCGGAGTT TGGCCGTAGG 
GACGTAGAGA ACGTAGATA6 CGACGCCGCC GCCGCCTCAA ACCGGCATCC 

8701 TGGCGCCCTC TTCCTCCCAT GCGTGTGACC CCGAAGCCCC TCATCGGCTG 
ACCGCGGGAG AAGGAGGGTA CGCACACTGG GGCTTCGGGG A6TAGCCGAC 

8751 AAGCAGGGCT AGGTCGGCGA CAACGCGCTC GGCTAATATG GCCTGCTGCA 
TTC6TCCCGA TCCAGCCGCT GTT6CGCGAG CCGATTATAC CGGACGACGT 

8801 CCTGCGTGAG GGTAGACTG6 AAGTCATCCA TGTCCACAAA GCGGTGGTAT 
GGACGCACTC CCATCTGACC TTCAGTAGGT ACAGGTGTTT CGCCACCATA 

8851 GCGCCCGTGT TCATGGTGTA AGTGCAGTTG GCCATAACGG ACCAGTTAAC 
CGCGGGCACA ACTACCACAT TCACGTCAAC CGGTATTGCC TGGTCAATTG 

8901 GGTCTGGTGA CCCGGCTGCG AGAGCTCGGT GTACCTGAGA CGCGAGTAAG 
CCAGACCACT GGGCCGACGC TCTCGAGCCA CATGGACTCT GCGCTCATTC 

8951 CCCTCGAGTC AAATACGTAG TCGTTGCAAG TCCGCACCAG GTACTGGTAT 
GGGAGCTCAG TTTATGCATC AGCAACGTTC AGGCGTGGTC CATGACCATA 

9001 CCCACCAAAA AGTGCGGCGG CGGCTGGCGG TAGAGGGGCC AGCGTAGGGT 
GGGTGGTTTT TCACGCCGCC GCCGACCGCC ATCTCCCCGG TCGCATCCCA 

9051 GGCCGGGGCT CCGGGGGCGA GATCTTCCAA CATAAGGCGA TGATATCCGT 
CCGGCCCCGA GGCCCCCGCT CTAGAAGGTT GTATTCCGCT ACTATAGGCA 

9101 AGATGTACCT GGACATCCAG GTGATGCCGG CGGCGGTGGT GGAGGCGCGC 
TCTACATGGA CCTGTAGGTC CACTACGGCC GCCGCCACCA CCTCCGCGCG 

9151 G6AAAGTCGC GGACGCGGTT CCAGATGTTG CGCAGCGGCA AAAAGTGCTC 
CCTTTCAGCG CCTGCGCCAA GGTCTACAAC GCGTCGCCGT TTTTCACGAG 

9201 CATGGTCGGG AC6CTCTGGC CGGTCAGGCG CGCGCAATCG TTGACGCTCT 
GTACCAGCCC TGCGAGACCG GCCAGTCCGC GCGC6TTAGC AACTGCGAGA 

9251 AGACCGTGCA AAAGGAGAGC CTGTAA6CGG GCACTCTTCC GTGGTCTGGT 
TCTGGCACGT TTTCCTCTCG GACATTCGCC CGTGAGAAGG CACCAGACCA 

9301 GGATAAATTC GCAAGGGTAT CATGGCGGAC GACCGGGGTT CGAGCCCC6T 
CCTATTTAAG CGTTCCCATA GTACCGCCTG CTGGCCCCAA GCTC6GGGCA 

9351 ATCCGGCCGT CCGCCGTGAT CCATGCGGTT ACCGCCCGCG TGTCGAACCC 
TAGGCCGGCA GGCGGCACTA GGTACGCCAA TGGCGGGCGC ACAGCTTGGG 
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9401 AGGTGTGCGA C^^AGACAA CGGGGGAGTG CTCCTTTTGG CTTCCT^^A 
TCCACACGCT g3RgTCTGTT GCCCCCTCAC GAGGAAAACC GAAGGASC?3T 

9451 GGCGCGGCGG CTGCTGCGCT AGCTTTTTTG GCCACTGGCC GCGCGCAGCG 
CC6CGCC6CC 6ACGAC6CGA TCGAAAAAAC CG6TGACCGG CGCGC6TCGC 

9501 TAAGCGGTTA GGCTGGAAAG CGAAAGCATT AA6TGGCTCG CTCCCTGTAG 
ATTCGCCAAT CCGACCTTTC 6CTTTCGTAA TTCACCGAGC GAGGGACATC 

9551 CCGGAGGGTT ArTTTCCAAG GGTTGAGTCG CGGGACCCCC GGTTCGAGTC 
GGCCTCCCAA TAAAAGGTTC CCAACTCAGC GCCCTGGGGG CCAAGCTCAG 

9601 TCGGACCGGC CGGACTGCGG CGAACGGGGG TTTGCCTCCC CGTCATGCAA . 
AGCCTGGCCG GCCTGACGCC . 6CTTGCCCCC AAACGGAGG6 GCAGTACGTT 

9651 GACCCCGCTT GCAAATTCCT CCGGAAACAG GGACGAGCCC CTTTTTTGCT 
CTGGGGCGAA CGTTTAAGGA GGCCTTTGTC CCTGCTCGGG GAAAAAACGA 

9701 TTTCCCAGAT GCATCCGGTG CTGCGGCAGA TGCGCCCCCC TCCTCAGCAG 
AAAGGGTCTA CGTA6GCCAC GACGCCGTCT ACGCGGGGGG AGGAGTCGTC 

9751 CGGCAAGAGC AAGAGCAGCG GCAGACATGC AGGGCACCCT CCCCTCCTCC 
GCCGTTCTCG TTCTCGTCGC CGTCTGTACG TCCCGTGGGA GGGGAGGAGG 

9801 TACCGCGTCA GGAGGGGCGA CATCCGCGCT TGACGCGGCA GCAGATGGTG 
ATGGCGCAGT CCTCCCCGCT GTAGGCGCCA ACTGCGCCGT CGTCTACCAC 

9851 ATTACGAACC CCCGCGGCGC CGGGCCCGGC ACTACCTGGA CTT6GAGGAG 
TAATGCTTGG 6GGCGCCGCG GCCCGGGCCG TGATGGACCT GAACCTCCTC 

9901 GGCGAGGGCC TGGCGCGGCT AGGAGCGCCC TCTCCTGAGC GGCACCCAAG 
CCGCTCCCGG ACCGCGCCGA TCCTCGCGGG AGAGGACTCG CCGTGGGTTC 

9951 GGTGCAGCTG AAGCGT6ATA CGCGTGAGGC GTACGTGCCG CGGCAGAACC 
CCACGTCGAC TTCGCACTAT GCGCACTCCG CATGCACGGC GCCGTCTTGG 

10001 TGTTTCGCGA CCGCGAGGGA GAGGAGCCCG AGGAGATGCG GGATCGAAAG 
ACAAAGCGCT GGCGCTCCCT CTCCTCGGGC TCCTCTACGC CCTAGCTTTC 

10051 TTCCACGCAG GGCGCGAGCT GCGGCATGGC CTGAATCGCG AGCGGTTGCT 
AAGGTGCGTC CCGCGCTCGA CGCCGTACCG GACTTAGCGC TCGCCAACGA 

10101 GCGCGAGGAG GACTTTGAGC CCGACGCGC6 AACCGGGATT AGTCCCGCGC 
CGCGCTCCTC CTGAAACTCG GGCTGCGCGC TTGGCCCTAA TCAGG6CGCG 

10151 GCGCACACGT GGCGGCCGCC GACCTGGTAA CCGCATACGA GCAGACGGTG 
CGCGTGTGCA CCGCCGGCGG CT6GACCATT. GGCGTATGCT CGTCT6CCAC 

10201 AACCAGGAGA TTAACTTTCA AAAAAGCTTT AACAACCACG TGCGTACGCT 
TTGGTCCTCT AATTGAAAGT TTTTTCGAAA TTGTTGGTGC ACGCATGCGA 

10251 TGTGGCGCGC GAGGAGGTGG CTATAGGACT GATGCATCTG TGGGACTTTG 
ACACCGCGCG CTCCTCCACC GATATCCTGA CTACGTAGAC ACCCTGAAAC 

10301 TAAGCGCGCT GGAGCAAAAC CCAAATAGCA AGCCGCTCAT GGCGCAGCTG 
ATTCGCGCGA CCTCGTTTTG GGTTTATCGT TCGGCGAGTA CCGCGTCGAC 




106/144 



wo 02/22080 



PCT/USOl/28861 



10351 TTCCTTATAG TlifcGCACAG CAGGGACAAC GAGGCATTCA GGGATGiMrr 
AAGGAATATC AOTICGTGTC GTCCCTGTTG CTCCGTAAGT CCCTACGiCA 

10401 GCTAAACATA GTAGAGCCCG AGGGCCGCTG GCTGCTCGAT TTGATAAACA 
CGATTTGTAT CATCTCGGGC TCCCGGCGAC CGACGAGCTA aactatttgt 

10451 TCCTGCAGAG CATAGTGGTG CAGGAGCGCA 6CTTGAGCCT GGCTGACAAG 

AGGACGTCTC GTATCACCAC gtcctcgcgt cgaactcgga ccgactgttc 

10501 GTGGCCGCCA TCAACTATTC CATGCTTAGC CTGGGCAAGT TTTACGCCCG 
CACCGGCGGT AGTTGATAAG GTACGAATCG GACCCGTTCA AAATGCGGGC 

10551 CAAGATATAC CATACCCCTT ACGTTCCCAT AGACAAGGAG GTAAAGATCG 
GTTCTATATG GTATGGGGAA TGCAAGGGTA TCTGTTCCTC CATTTCTAGC 

10601 AGGGGTTCTA CATGCGCATG GCGCTGAAGG TGCTTACCTT GAGCGACGAC 
TCCCCAAGAT GTACCCGTAC CGCGACTTCC ACGAATGGAA CTCGCTGCTG 

10651 CTGGGCGTTT ATCGCAACGA GCGCATCCAC AAGGCCGTGA GCGTGAGCCG 
GACCCGCAAA TAGCGTTGCT CGCGTAGGTG TTCCGGCACT C6CACTCGGC 

10701 GCGGCGCGAG CTCAGCGACC GCGAGCTGAT GCACAGCCTG CAAAGGGCCC 
CGCCGCGCTC GAGTC6CTGG CGCTCGACTA CGTGTCGGAC GTTTCCCGGG 

10751 TGGCTGGCAC GGGCAGCGGC GATAGAGAGG CCGAGTCCTA CTTTGACGCG 
ACCGACCGTG CCCGTCGCCG CTATCTCTCC GGCTCAGGAT GAAACTGCGC 

10801 GGCGCTGACC TGCGCTGGGC CCCAAGCCGA CGCGCCCTGG AGGCAGCTGG 
CCGCGACTGG ACGCGACCCG GGGTTCGGCT 6CGCGGGACC TCCGTC6ACC 

10851 GGCCGGACCT GGGCTGGCGG TGGCACCCGC GCGCGCTGGC AACGTCGGCG 
CCGGCCTGGA CCCGACCGCC ACCGTGGGCG CGCGCGACCG TTGCAGCCGC 

10301 GCGTGGAGGA ATATGACGAG GACGATGAGT ACGAGCCAGA GGACGGCGAG 
CGCACCTCCT TATACTGCTC CTGCTACTCA TGCTCGGTCT CCTGCCGCTC 

10951 TACTAAGCGG TGATGTTTCT GATCAGATGA TGCAAGAC6C AACGGACCCG 
ATGATTCGCC ACTACAAAGA CTAGTCTACT ACGTTCTGCG TTGCCTGGGC 

11001 GCGGTGCGGG CGGCGCTGCA GAGCCAGCCG TCCGGCCTTA ACTCCACGGA 
CGCCACGCCC GCCGCGACGT CTCGGTCGGC AGGCCGGAAT TGAGGTGCCT 

11051 .CGACTGGCGC CAGGTCATGG ACCGCATCAT GTCGCTGACT GCGCGCAATC 
GCTGACCGCG GTCCAGTACC TGGCGTAGTA CAGCGACTGA CGCGCGTTA6 

11101 CTGACGCGTT CCGGCAGCAG CCGCAGGCCA ACCGGCTCTC CGCAATTCTG 
GACTGCGCAA 6QCCGTCGTC GGCGTCCGGT TG6CCGAGAG GCGTTAAGAC 

11151 GAA6CG6TGG TCCCGGCGCG CGCAAACCCC ACGCACGA6A AGGTGCTGGC 
CTTCGCCACC AGGGCCGCGC GCGTTTGGG6 TGCGTGCTCT TCCACGACCG 

112 01 GATCGTAAAC GCGCTGGCCG AAAACAGGGC CATCCGGCCC GACGAGGCCG 
CTA6CATTTG CGCGACC6GC TTTTGTCCC6 GTAGGCCGGG CTGCTCCGGC 

11251 GCCTGGTCTA CGACGCGCTG CTTCAGCGCG TGGCTCGTTA CAACAGCGGC 
CGGACCAGAT GCTGCGCGAC GAAGTCGCGC ACCGAGCAAT GTTGTCGCCG 
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11301 AACGTGCAGA C^nCTGGA CCGGCTGGTG GGGGATGTGC' QCGAGGp^T 
TTGCACGTCT gSItGGACCT GGCCGACCAC CCCCTACACG CGCTCCSBtA 

11-351 GGCGCAGCGT GAGCGCGCGC AGCAGCAGGG CAACCTGGGC TCCATGGTTG 
CCGCGTCGCA CTCGCGCGCG TCGTCGTCCC GTTGGACCCG AGGTACCAAC 

11401 CACTAAACGC CTTCCTGA6T ACACA6CCCG CCAACGTGCC GCGGGGACAG 
GTGATTTGCG GAAGGACTCA TGTGTCGGGC GGTT6CACGG CGCCCCTGTC 

11451 GAGGACTACA CCAACTTTGT GAGCGCACTG CGGCTAATGG TGACTGAGAC 
CTCCT6AT6T GGTTGAAACA CTCGCGTGAC GCCGATTACC ACTGACTCTG 

11501 ACCGCAAAGT GAGGTGTACC AGTCTGGGCC AGACTATTTT TTCCAGACCA 
TGGCGTTTCA CTCCACATGG TCAGACCCGG TCTGATAAAA AAGGTCTGGT 

11551 Gl'AGACAAGG CCTGCAGACC GTAAACCTGA GCCAGGCTTT CAAAAACTTG 
CATCTGTTCC GGACGTCTGG CATTTGGACT CCCTCCCAAA GTTTTTGAAC 

11601 CAGGGGCTGT OGGGGGTGCG GGCTCCCACA GGCGACCGCG CGACCGTGTC 
GTCCCCGACA CCCCCCACGC CCGAGGGTGT CCGCTGGCGC GCTGGCACAG 

11651 TAGCTTGCTG ACGCCCAACT CGCGCCTGTT GCTGCTGCTA ATAGCGCCCT 
ATCGAACGAC TGCGGGTTGA GCGCGGACAA CGACGACGAT TATCGCGGGA 

11701 TCACGGACAG TGGCAGCGTG TCCCGGGACA CATACCTAGG TCACTTGCTG 
AGTGCCTGTC ACCGTCGCAC AGGGCCCTGT GTATGGATCC A6TGAACGAC 

11751 ACACTGTACC GCGAGGCCAT AGGTCAGGCG CATGTGGACG AGCATACTTT 
TGTGACATGG CGCTCCGGTA TCCAGTCCGC GTACACCTGC TCGTATGAAA 

11801 CCAGGAGATT ACAAGTGTCA GCCGCGCGCT GGGGCAGGAG GACACGGGCA 
GGTCCTCTAA TGTTCACAGT CGGCGCGCGA CCCCGTCCTC CTGTGCCCGT 

11B51 GCCTGGAGGC TlACCCTAAAC TACCTGCTGA CCAACCGGCG GCAGAAGATC 
CGGACCTCCG TTGGGATTTG ATGGACGACT GGTTGGCCGC CGTCTTCTAG 

11901 CCCTCGTTGC ACAGTTTAAA CAGCGAGGAG GAGCGCATTT TGCGCTACGT 
GGGAGCAACG TGTCAAATTT GTCGCTCCTC CTCGCGTAAA ACGC6ATGCA 

11951 GCAGCAGAGC GTGAGCCTTA ACCTGATGCG CGACGGGGTA ACGCCCAGCG 
CGTCGTCTCG CACTC6GAAT TGGACTACGC GCTGCCCCAT TGCGGCTCGC 

12001 TGGCGCTGGA CATGACCGCG CGCAACATGG AACCGGGCAT GTATGCCTCA 
ACCGCGACCT GTACTGGCGC GCGTTGTACC TTGGCCCGTA CATACGGAGT 

12051 AACCGGCCGT TTATCAACCG CCTAATGGAC TACTTGCATC GCGCGGCCGC 
TTGGCCGGCA AATAGTTGGC GGATTACCT6 ATGAACGTAG CGCGCCGGCG 

12101 C6TGAACCCC GAGTATTTCA CCAATGCCAT CTTGAACCCG CACTGGCTAC 
GCACTTGGGG CTCATAAA6T GGTTACGGTA GAACTTGGGC GTGACCGATG 

12151 CGCCCCCTGG TTTCTACACC GGG0GATTC6 AGGTGCCCGA QGGTAACGAT 
GCGGGGGACC AAAGAT6TGG CCCCCTAA6C TCCACGGGCT CCCATTGCTA 

12201 CGATTCCTCT CG6ACGACAT A6ACGACA6C GTGTTTTCCC CGCAACCGCA 
CCTAAGGAGA CCCTGCTGTA TCTGCTGTCG CACAAAAGGG GCGTTGGCGT 
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12251 GACCCT6C7A GWTGCAAC AGCGCGAGCA GGCACAGGCCTITCGCTSMa 
CTGGGACGAT dWRACGTTG TCGCGCTCGT CCGTCTCCGC CGCGAC^T 

12301 AGGAAAGCTT CCGCAGGCCA AGCAGCTTGT CCGATCTAGG CGCTGCGGCC 
TCCTTTCGAA GGCGTCCGGT TCGTCGAACA GGCTAGA7CC GCGACGCCGG 

12351 CCGCGGTCAG ATGCTAGTAG CCCATTTCCA AGCTTGATAG GGTCTCTTAC 
GGCGCCAGTC TACGATCATC GGGTAAAGGT TCGAACTATC CCAGAGAATG 

12401 CAGCACTCGC ACX^lCCCGCC CGCGCCTGCT 6GGCGAGGAG GAGTACCTAA 
GTCGTGAGCG TGGTGGGCGG GCGCGGACGA CCCGCTCCTC CTCATGGATT 

12451 ACAACTCGCT GCTGCAGCCG CAGCGCGAAA AAAACCTGCC TCCGGCATTT 
TGTTGAGC6A CGACGTCGGC GTCGCGCTTT TTTTGGACGG AGGCCGTAAA 

12501 CCCAACAACG GGATAGAGAG CCTAGTGGAC AAGATGAGTA GATGGAAGAC 
GGGTTGTTGC CCTATCTCTC GGATCACCTG TTCTACTCAT CTACCTTCTG 

12551 GTACGCGCAG GAGCACAGGG ACGTGCCAGG CCCGCGCCCG CCCACCCGTC 
CATGCGCGTC CTCGTGTCCC TGCACGGTCC GGGCGCGGGC GGGTGGGCAG 

12601 GTCAAAGGCA CGACCGTCAG CGGGGTCTGG TGTGGGAGGA CGATGACTCG 
CAGTTTCCGT GCTGGCAGTC GCCCCAGACC ACACCCTCCT GCTACTGAGC 

12651 GCAGACGACA GCAGCGTCCT GGATTTGGGA GGGAGTGGCA ACCCGTTTGC 
CGTCTGCTGT CGTCGCAGGA CCTAAACCCT CCCTCACCGT TGGGCAAACG 

12701 GCACCTTCGC CCXIAGGCTGG GGAGAATGTT TTAAAAAAAA AAAAAGCATG 
CGTGGAA6CG GGGTCCGACC CCTCTTACAA AATTTTTTTT TTTTTCGTAC 

12751 ATGCAAAATA AAAAACTCAC CAAGGCCATG GCACCGAGCG TTGGTTTTCT 
TACGTTTTAT TrTTTGAGTG GTTCCGGTAC CGTGGCTCGC AACCAAAAGA 

12BD1 TGTATTCCCC TTAGTATGCG GCGCGCGGCG ATGTATGAGG AAGGTCCTCC 
ACATAAGGGG AATCATACGC CGCGC6CCGC TACATACTCC TTCCAGGAGG 

12B51 TCCCTCCTAC GAGAGTGT6G TGAGC6CGGC GCCAGTGGCG GCGGCGCTGG 
AGGGAGGATG CTCTCACACC ACTCGCGCCG CGGTCACCGC CGCCGCGACC 

12901 GTTCTCCCTT CGATGCTCCC CTGGACCCGC CGTTTGTGCC TCCGCGGTAC 
CAAGAGGGAA GCTACGAGGG GACCT6GGC6 GCAAACAC6G AGGCGCCATG 

12951 CTGCGGCCTA CCGGGGGGAG AAACAGCATC CGTTACTCTG AGTTGGCACC 
GACGCCGGAT GGCCCCCCTC TTTGTC6TAG GCAATGAGAC TCAACCGTGG 

13001 CCTATTCGAC ACCACCCGTG TGTACCTGGT GGACAACAAG TCAACGGATG 
GGATAAGCTG TGGTGGGCAC ACATGGACCA CCTGTTGTTC AGTTGCCTAC 

13051 TGGCATCCCT GAACTACCAG AACGACCACA GCAACTTTCT GACCACGGTC 
ACCGTAGGGA CTTGATGGTC TTCCTC6TGT CGTTGAAAGA CTGGT6CCAG 

13101 ATTCAAAACA ATGACTACAG CCGGGGGGAG 6CAAGCACAC AGACCATCAA. 

TAAGTTTTGT TACTGAT6TC GGGCCCCCTC CGTTCGTGTG TCTGGTAGTT 

13151 TCTTGACGAC CGGTCGCACT GGGGCGGC6A CCTGAAAACC ATCCTGCATA 
AGAACTGCTG GCCAGCGTGA CCCCGCCGCT GCACTTTTGG TAGGACGTAT 
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13201 CCAACATGCC AMKGTGAAC GAGTTCATGT TTACC AATAA '^lITAfiatKG 
GGTTGTACGG TiPTCACTTG CTCAAGTACA AATGCTTATT CAAATTa^C 

13251 CGGGTGATGG TGTCGCGCTT GCCTACTAAG GACAATCAGG TGGAGCTGAA 
GCCCACTACC ACAGCGCGAA CGGATGATTC CtGTTAGTCC ACCTCGACTT 

13301 ATACGAGTGG GTGGAGTTCA CGCTGCCCGA GGGCAACTAC TCCGAGACCA 
TATGCTCACC CACCTCAA6T GCGACGGGCT CCCGTTGATG AGGCTCTGGT 

13351 TGACCATAGA CCTTATGAAC AACGCGATCG TGGAGCACTA CTTGAAAGTG 
ACTGGTATCT GGAATACTTG TTGCQCTAGC ACCTCGTGAT GAACTTTCAC 

13401 GGCAGACAGA ACG6GGTTCT GGAAAGCGAC ATC6GGGTAA AGTTTGACAC 
CCGTCTGTCT TGCCCCAAGA CCTTTCGCTG TAGCCCCATT TCAAACTGTG 

13451 CCGCAACTTC AGACTGGGGT TTGACCCCGT CACTGGTCTT GTCATGCCTG 
GGCGTTGAAG TCTGACCCCA AACTGGGGCA GTGACCAGAA CAGTACGGAC 

13501 GGGTATATAC AAACGAAGCC TTCCATCCAG ACATCATTTT GCTGCCAGGA 
CCCATATATG TTTGCTTCGG AAGGTAGGTC TGTAGTAAAA CGACGGTCCT 

13551 TGCGGGGTGG ACTTCACCCA CAGCCGCCTG AGCAACTTGT TGGGCATCCG 
ACGCCCCACC TGAAGTGGGT GTCGGCGGAC TCGTTGiAACA ACCCGTAGGC 

13601 CAAGCGGCAA CCCTTCCAGG AGGGCTTTAG GATCACCTAC GATGATCTGG 
GTTCGCCGTT GGGAAGGTCC TCCCGAAATC CTAGTGGATG CTACTAGACC 

13651 AGGGTGGTAA CATTCCCGCA CTGTTGGATG TGGACGCCTA CCAGGCGAGC 
TCCCACCATT GTAAGGGCGT GACAACCTAC ACCTGCGGAT GGTCCGCTCG 

13701 TTGAAAGATG ACACCGAACA GGGCGGGGGT GGCGCAGGCG GCAGCAACAG 
AACTTTCTAC TGTGGCTTGT CCCGCCCCCA CCGCGTCCGC CGTCGTTGTC 

137 51 CAGTGGCAGC GGCGCGGAA6 AGAACTCCAA CGCGGCAGCC GCGGCAATGC 
GTCACCGTCG CCGCGCCTTC TCTTGAGGTT GCGCCGTCGG CGCC6TTACG 

13801 AGCCGGTGGA G6ACATGAAC GATCATGCCA TTCGCGGCGA CACCTTTGCC 
TCGGCCACCT CCTGTACTTG CTAGTACGGT AAGCGCCGCT GTGGAAACGG 

13851 ACACGGGCTG AGGAGAAGCG CGCTGAGGCC GAAGCAGCGG CCGAAGCTGC 
TGTGCCCGAC TCCTCTTCGC GCGACTCCCG CTTCGTCGCC GGCTTCGACG 

13901 CGCCCCCGCT GCGCAACCCG AGGTCGAGAA GCCTCAGAAG AAACCGGTGA 
GCGGGGGCGA CGCGTTGGGC TCCAGCTCTT CGGAGTCTTC TTTGGCCACT 

13951 TCAAACCCCT GACAGAGGAC AGCAAGAAAC GCAGTTACAA CCTAATAAGC 
AGTTTGGGGA CTGTCTCCTG TCGTrCrXTG CGTCAATGTT GGATTATTCG 

14001 AATGACAGCA CCTTCACCCA CTACCGCAGC TGGTACCTT6 CATACAACTA 
TTACTGTCGT GGAAGTGGGT CATGGCGTCG ACCATGGAAC GTATGTTGAT 

14051 CGGCGACCCT CAGACCGGAA TCCGCTCATG GACCCTGCTT TCCACTCCTG 
GCCGCTGGGA GTCTGGCCTT AGGCGAGTAC CTG6GACGAA ACGTGAGGAC 

14101 ACGTAACCTG CGGCTCGGAG CAGGTCTACT GGTCGTTGCC AGACATGATG 
TGCATTGGAC GCCGAGCCTC GTCCAGATGA CCAGCAACGG TCTGTACTAC 
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14151 CAAGACCCCG IH^B^'TTCCG TCTCCACOCGC CAGATCAGCA ACTTPCPMT 
GTTCTGGGGC ACfTCGAAGGC GAGGTOCGCG GTCTAGTC6T TGAAAGdWA 

14201 GGTGG6CGCC GAGCTGTTGC CCGTGCACTC CAAGAGCTTC TACAACGACC 
CCACCCGCGG CTC6ACAACG GGCACGTGA6 GTTCTCGAAG ATGTT6CTGG 

14251 AGGCCGTCTA CTTCCCAACTC ATCCGCCA6T TTACCTCTCT 6ACCCAC6TG 
TCCGGCAGAT GAGGGTTGAC TAGGCCCTCA AATGGAGAGA CTGGGTGCAC 

14301 TTCAATCGCT TTCCCGAGAA CCAGATTTTO GC6CGCCCGC CAGCCCCCAC 
AAGTTAGCGA AAGGGCTCTT GGTCTAAAAC C6C6C6G6CG GTCGGGGGTG 

14351 CATCACCACC CTCAGTGAAA AC6TTCCTGC TCTCACAGAT CACGGGACGC 
GTAGTGGTGG CAGTCACTTT TGCAAGGACG AGAGTGTCTA 6TGCCCT6CG 

14401 TACCGCTGCG CAACAGCATC GGAGGAGTCC ACCCACTGAC CATTACTGAC 
ATGGCGACGC GTTGTCGTAG CCTCCTCAQQ TCGCTCACTG OTAATGACTG 

14451 GCCAGACGCC GCACCTGCCC CTACGTTTAC AAGGCCCTGG GCATAGTCTC 
CGGTCTGCGG CGTGGACGGG GATGCAAATG TTCCGGGACC CGTATCAGAG 

14501 GCCGCGCGTC CTATCGAGCC GCACTTTTTG AGCAAGCATG TCCATCCTTA 
C5GCGCGCAG GATAGCTCGG CGTGAAAAAC TCGTTCGTAC AGGTAGGAAT 

14551 TATCGCCCAG CAATAACACA GGCTGGGGCC TGCGCTTCCC AAGCAAGATG 
ATAGCGGGTC GTTATTGTGT CCGACCCCGG ACGCGAAGGG TTCGTTCTAC 

14601 TTTGGCGGGG CCAAGAAGCG CTCCGACCAA CACCCAGTGC OCGTGCGCGG 
AikACCGCCCC GGTTCTTCGC GAGGCTGGTT GTGGGTCACG CGCACGCGCC 

14651 GCACTACCGC GCGCCCTGGG GCGCGCACAA ACGCGGCCGC ACTGGGCGCA 
CGTGATGGCG CGCGGGACCC CGCGCGTGTT TGCGCCGGCG TGACCCCCGT 

14701 CCACCGTCGA TGACGCCATC GACGCGGTOG TGGAGGAGGC GCGCAACTAC 
GGTGGCAGCT ACTGCGGTAG CTGCGCCACC ACCTCCTCCG CGCGTTGATG 

14751 ACGCCCACGC CGCCACCAGT GTCCACAGTG GACGCGGCCA TTCAGACCGT 
TGCGGGTGCG GCGGTGGTCA CAGGTGTCAC CTGCGCCGGT AAGTCTGGCA 

14B01 6GTGCGCGGA OCCCGGCGCT ATGCTAAAAT 6AAGAGACGG CGGAGGCGCG 
CCACGCGCCT CGGGCCGCGA TACGATTXTA CTTCTCTGCC GCCTCCGCGC 

14851 TAGCACGTC6 CCACCGCCGC CGACCCGGCA CTGCCGCCCA ACGCGCGGCG 
ATCGTGCAGC GGTGGCGGCG GCTGG6CCGT GACGGCGG6T TGCGC6CCGC 

14901 GCGGCCCTGC TTAACCGCGC ACGTC6CACC GGCCGAC6G6 CGGCCATGCG 
CGCCGGGACG AATTGGCGCG TGCAGCGTGG CCGGCTGCCC GCCGGTACGC 

14951 CGCCGCTCGA AGGCTGGCC6 CGGGTATTCT CACTGTGCCC CCCAGGTCCA 
CCGGCGAGCT TCCGACC60C GCCCATAACA GTGACACGGG GGGTCCAGGT 

15001 GGCGACGAGC GGCCGCCGCA GCAGCCGCGG CCATTAGTGC TATGACTCAG 
CCGCTGCTCG CCGGCGGCGT CGTCGGCGCC GGTAATCACG ATACTGAGTC 

15051 GGTCGCAGGG GCAACGTGTA TTGGGTQCGC GACTCGGTTA GCGGCCTGCG 
CCAGCGTCCC CGTTGCACAT AACCCACX3C6 CTGAGCCAAT CGCCG6ACGC 
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15101 CGTGCCCGTG COiiCCCGCC CCCCGCGCAA CTAGATTGCA^GftAAASamr 
GCAC66GCAC G&SPSGGCGG GGG6CGCGTT GATCTAACGT TCTTil'JHiA 

15151 ACTTAGACTC 6TACTGTTGT ATGTATCCA6 CGGCG6CGGC GCGCAACGAA 
TGAATCTGAG CATGACAACA TACATAGGTC GCCGCCGCCG CGCGTTGCTT 

15201 GCTATGTCCA AOCGCAAAAT CAAA6AAGAG ATGCTCCAG6 TCATCGCGCC 
CGATACAGGT TCGCOTTTTA GTTTCTTCTC TACGAGGTCC AGTAGCGCGG 

15251 G6AGATCTAT GGCCCCCCGA AGAAGGAA6A GCAGGATTAC AAGCCCC6AA 
CCTCTAGATA CCXKS6G6GCT TCTTCCTTCT CGTCCTAATG TPCGGGGCTT 

15301 A6CTAAA6CG GGTCAAAAA6 AAAAAGAAAG ATGATGATGA TGAACTTGAC 
TCGATTTCGC CCAGTTTTTC TTTTTCTTTC TACTACTACT ACTTGAACTG 

15351 GACGAGGTGG AACTGCTGCA CGCrACCGCG CCCAGQCGAC GGGTACAGTG 
CTGCTCCACC TIGACGACGT GCGATGGCGC GGGTCCGCTG CCCATGTCAC 

15401 6AAAGGTCGA CGCGTAAAAC GTGTTTTGCG ACCCGGCACC ACCGTAGTCT 
CTTTCCAGCT GCGCATTTTG CACAAAACGC TGGGCCGTGG TGGCATCAGA 

15451 TTACGCCCGG TGA6CGCTCC ACCCGCACCT ACAAGCGCGT GTATGATGAG 
AATGCGGGCC ACTCGCGAGG TGGGCGTG6A TGTTCGCGCA CATACTACTC 

15501 GTGTACGGCG ACGAGGACCT GCTTGAGCAG GCCAACGAGC GCCTCGGGGA 
CACATGCCGC TGCTCCTGGA CGAACTCGTC CGGTTGCTCG CGGAGCCCCT 

15551 GTTTGCCTAC GGAAAGCGGC ATAAGGACAT GCTGGCGTTG CCGCTGGACG 
CAAACGGAT6 OCTTTCCCCG TATTCCTGTA CGACCGCAAC GGCGACCTGC 

15601 AGGGCAACCC AACACCTAGC CTAAAGCCCG TAACACTGCA 6CAGGTGCTG 
TCCCGTTGGG TTGTGGATCG GATTTCGGGC ATTGTGACGT CGTCCACGAC 

15651 CCCGCGCTTG CACCGTCCGA AGAAAAGCGC G6CCTAAAGC 6CGAGTCTGG 
GGGCGCGAAC GT6GCAG6CT TCTTTTCGCG CC6GATTTCG CGCTCAGACC 

15701 TGACTTGGCA CCCACCGTGC AGCTGATCGT ACCCAAGCGC CAGCGACTGG 
ACTGAACCGT GGGTGGCACG TCGACTACCA TCGGTTCGCG GTCGCT6ACC 

15751 AAGATGTCTT GGAAAUUITG ACCGTGGAAC CTGG6CT6GA GCCC6AG6TC 
TTCTACAGAA CCTTTTTTAC TCCCACCTTG CACCCGACCT CBGGCTCCAG 

15801 CGCGTGCGGC CAATCAAGCA GGTGGC6CC6 GGACT66GCG TGCAGACCGT 
GCGCACGCCG GTTAGTTCGT CCACCGCGGC' CCTGACCCGC ACGTCTGGCA 

15B51 GGACGTTCA6 ATACCCACTA CCAGTAGCAC CAGTATTGCC ACXGCCACAG 
CCTGCAAGTC TATGGGT6AT GGTCATC6TG GTCATAACGG TGGC66TGTC 

15901 AGGGCATGGA GACACAAACG OWCCGCTTG CCTCAGCGOT .GGCGCSATGCC 
TCCCGTACCT CT6TGTTTGC AGGGGCCAAC GQAGTCGCCA CCGCCTACGG 

15951 GCGGTGCAGG CGGTCGCTGC GGCCGCGTCC AAGACCTCTA CGGA6GTGGA 
CGCCACGTCC GCCAGCGACG CCGGCGCAGG -TTCTGGAGAT GCCTCCACGT 

16001 AACGGACCCG TGGATGTTTC GCGTTTCAGC CCCCCGGCGC CCGCGCCGTT 
TTGCCTGGGC ACCTACAAAG C6CAAA6TCG GGGGGCCGCG GGCGCGGCAA 
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16051 CGAGGAAGTA CiSKfeCCGCC AGCGCGCTAC TGCCCGAATA TGCCCTflBTT 
GCTC.CTTCAT GCCGCGGCGG TCGCGCGATG ACGGGCTTAT ACGGGATGTA 

16101 CCTTCCATTG CGCCTACCCC CGGCTATCGT GGCTACACCT ACCGCCCCAG 
GGAAGGTAAC GCGGATGGGG GCCGATAGCA CCGATGTGGA TGGCGGGGTC 

16151 AAGACGAGCA ACTACCCGAC GCCGAACCAC CACTGGAACC CGCCGCCGCC 
TTCTGCTCGT TGATGGGCTG CGGCTTGGTG GTGACCTTG6 OCQGCQGCGG 

16201 GTCGCCGTCG CCAGCCCGTG CTGGCCCCGA TTTCCGTGCG CAGGGTGGCT 
CAGCGGCAGC GGTCGGGCAC GACCGGGGCT AAAGGCACGC GTCCCACCGA 

16251 CGCGAAGGAG GCAGGACCCT GGTGCTGCCA ACAGCGCGCT ACCACCCCAG 
GCGCTTCCTC CGTCCTGGGA CCACGACGGT TGTCGCGCGA TGGTGGGGTC 

16301 CATCGTTTAA AAGCCGGTCT TTGTGGTTCT TGCAGATATG GCCCTCACCT 
GTAGCAAATT TTCGGCCA6A AACACCAAGA ACGTCTATAC CGQGAGTGGA 

16353 GCCGCCTCCG TTTCCCGGTG CCGGGATTCC GAGGAAGAAT GCACCGTAGG 
CGGCGGAGGC AAAGGGCCAC GGCCCTAAGG CTCCTTCTTA CGTGGCATCC 

16401 AGGGGCATGG CCGGCCACGG CCTGACGGGC GGCATGCGTC GTGCGCACCA 
TCCCCGTACC GGCCGGTGCC GGACTGCCCG CCGTACGCAG CACGCGTGGT 

16451 CCGGCGGCGG CGCGCGTCGC ACCGTCGCAT GCGCGGCGGT ATCCTGCCCC 
GGCCGCCGCC GCGCGCAGCG TGGCAGCGTA CGCGCCGCCA TAGGACGGGG 

16501 TCCTTATTCC ACTGATCGCC GCGGCGATTG GCGCCGTGCC CGGAATTGCA 
AGGAATAAGG TGACTAGCGG CGCCGCTAAC CGCGGCACGG GCCTTAACGT 

16551 TCCGTGGCCT TGCAGGCGCA GAGACACTGA TTAAAAACAA GTTGCATGTG 
AGGCACCGGA ACGTCCGCGT CTCTGTGACT AATTTTTGTT CAACGTACAC 

16601 GAAAAATCAA AATAAAAAGT CTGGACTCTC ACGCTCGCTT GGTCCTGTAA 
CTTTTTAGTT TTATTTTTCA GACCTGAGAG TGCGAGCGAA CCAGGACATT 

16651 CTATTTTGTA GAATGGAAGA CATCAACTTT GCGTCTCTGG CCCCGCGACA 
GATAAAACAT CTTACCTTCT GTAGTTGAAA CGCAGAGACC GGGGCGCTGT 

15701 CGGCTCGCCC CC6TTCATGG GAAACTGGCA AGATATCGGC ACCAGCAATA 
GCCGAGCGCG GGCAAGTACC CTTTGACCGT TCTATAGCCG TGGTCGTTAT 

16751 TGAGCGGTGG CGCCTTCAGC TGGGGCTCGC TGTGGAGCGG CATTAAAAAT 
ACTCGCCACC GCGGAAGTCG ACCCCGAGCG ACACCTCGCC GTAATTTTTA 

16801 TTCG6TTCCA CCGTTAAGAA CTATGGCAGC AAGGCCTGGA ACAGCAGCAC 
AAGCCAAGGT GGCAATTCTT GATACCGTCG TTCCGGACCT TGTCGTCGTG 

16851 AGGCCAGATG CTGAGG6ATA AGTTGAAAGA GCAAAATTTC CAACAAAAGG 
TCCGGTCTAC GACTCCCTAT TCAACTTTCT CGTTTTAAAG GTTGTTTTCC 

16901 TGGTAGATGG CCTGGCCTCT GGCATTAGCG GGGTGGTGGA CCTGGCCAAC 
ACCATCTACC GGACCG6AGA CCGTAATCGC CCCACCACCT GGACCGGTTG 

16951 CAGGCAGTGC AAAATAAGAT TAACAGTAAG CTTGATCCCC GCCCTCCCGT 
GTCCGTCACG TTTTATTCTA ATTGTCATTC GAACTAGG6G CGGGAGGGCA 
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17001 AGAGGAGCCT ClM^GGCCG' TGGAdACAGT 6TCTCCAGAG 'GGSCGO^HC 

TCTCCTCGGA qSIBgccggc ACCTCTGTCA cagaggtctc cccgcaCWc 



TTTTCGCAGG CGCGGGGCTG TCCCTTCTTT GAGACCACTG CGTTTATCTG 

17101 GAGCCTCCCT 0GTACGA6GA GGCACTAAAG CAAQQCCTGC CCACCACCCG 
CTCGGAGGGA GCATGCTCCT CCGTCATTTC GTTCCGGACG GGTGGTGGGC 

17151 TCCCATCGCG CCCATGGCTA CCGGAGTGCT GGGCCAGCAC ACACCCGTAA 
AGGGTAGCGC GGGTACCGAT GGCCTCAC6A CCCGGTCGTG TGTGGGCATT 

17201 CGCTGGACCT GCCTCCCCCC GCCGACACCC AGCAGAAACC TGTGCTGCCA 
GCGACCTGGA CGGAGGGGGG CGGCTGTGGG TCGTCTTTGG ACACGACGGT 

17251 CCCCCGACCG CXrGTTGTTGT AACCCGTCCT AGCCGCGCGT CCCTGCGCCG 
CCGGGCT66C GGCAACAACA TTGGGCAGGA TCGGCGCGCA GGGACCCGGC 

17301 CGCCGCCAGC GGTCCGCGAT CGTTGCGGCC CGTAGCCAGT GGCAACTGGC 
GCGGCGGTCG CCAGGCGCTA GCAACGCCGG GCATCGGTCA CCGTTGACCG 

17351 AAAGCACACT GAACAGCATC GTGGGTCTGG GGGTGCAATC CCTGAAGCGC 
TTTCGTGTGA CTTGTCGTAG CACCCAGACC CCCACGTTAG GGACTTCGCG 

17401 CGACGATGCT TCTGATAGCT AACGTGTCGT ATGTGTGTCA TGTATGCGTC 
GCTGCTACGA AGACTATCGA TTGCACAGCA TACACACA6T ACATACGCAG 

17451 CATGTCGCCG CCA6AGGAGC TGCTGAGCCG CCGCGCGCCC GCTTTCCAAG 
GTACAGCGGC GGTCTCCTCG ACGACTCGGC GGCGCGCGGG CGAAAGGTTC 

17 501 ATGGCTACCC CTTCGATGAT GCCGCAGTGG TCTTACATGC ACATCTCGGG 
TACCtSATGCG GAAGCTACTA CGCCGTCACC AGAATGTACG TGTAGAGCCC 

17551 CCAGGACGCC TCGGAGTACC TGAGCCCCGG GCTGGTGCAG TTTGCCCGCG 
GGTCCTGCGG AGCCTCATGG ACTCGGGGCC CGACCACGTC AAACGGGCGC 

17601 CCACCGAGAC GTACTTCAGC CTGAATAACA AGTTTAGAAA CCCCACGGTG 
GGTGGCTCTG CATGAAGTCG GACTTATTGT TCAAATCTTT GGGGTGCCAC 

17651 GCCCCTACCC ACGACGTCAC CACAGACCCG TCCCAGCGTT TGACGCTGCG 
CGCGGATGCG TOCTGCACTG GTGTCTGGCC AGGGTCGCAA ACTCCGACGC 

17701 GTTCATCCCT GTGGACCGTG AGGATACTGC GTACTCGTAC AAGGCGCGGT 
CAA6TAGGGA CACCTGGCAC TCCTA7GACG CATGAGCATG TTCCGCGCCA 

17751 TCACCCTAGC TGTGGGTGAT AACCGTGTGC TGGACATGGC TTCCACGTAC 
AGTGGGATCG ACACCCACTA TTGGCACACG ACCTGTACCG AAGGTGCATG 

17801 TTTGACATCC GCGGCGTGCT GQACA0GQ9C CCTACTTTTA AGCCCTACTC 
AAACTGTAG6 CGCCGCAC6A CCTGTCCCCG GGATGAAAAT TCGGGATGAG 

17B51 TGGCACTGCC TACAACGCCC TGGCTCCCAA GGGTGCCCCA AATCCTTGCG 
ACCGTGACGG ATGTTGCGGG ACCGAGGGTT CCCACGGGGT TTAGGAACGC 

17901 AATGGGATGA AGCTGCTACT GCTCTTGAAA TAAACCTAGA AGAAGAGGAC 
TTACCCTACT TC6ACGATGA CGAGAACrTT ATTTGGATCT ICTTCTCCTG 



17051 



AAAA6CGTCC 



GCGCCCCGAC AGGGAAGAAA CTCTGGTGAC 



6CAAATAGAC 
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17951 GATGACAAC6 AJVCGAAGT A6AC6AGCAA GCTGA0CA6C MUMA^^h 
CTACTGTTGC TTCTGCTTCA TCT6CTCGTT CGACTCGTCG TTTTTTGAGT 

18001 C6TATTTGG6 CAGGCGCCTT ATTCTGGTAT AAATATTACA AAGGAGGGTA 
GCATAAACCC GTCCGCGGAA TAAGACCATA TTTATAATGT TTCCTCCCAT 

18051 TTCAAATAGG TGTCGAAGGT CAAACACCTA AATATGCCGA TAAAACATTT 
AAGTTTATCC ACAGCTTCCA GTTTCTGGAT TTATACGGCT ATTTTGTAAA 

3 8101 CAACCTGAAC CTCAAATAGG AGAATCTCAG TGGTACGAAA CAGAAATTAA 
GTTGGACTTG GAGTTTATCC TCTTAGAGTC ACCATGCTTT GTCTTTAATT 

18151 TCATGCAGCT G6GAGA6TCC TAAAAAAGAC TACCCCAATG AAACCATGTT 
AGTAC6TCGA CCCTCTCAGG ATTTTTTCTG ATGGGGTTAC TTTGGTACAA 

18201 ACGGTTCATA TGCAAAACCC ACAAATGAAA ATGGAGGGCA AGGCATTCTT 
TGCCAAGTAT ACGTTTTGGG TGTTTACTTT TACCTCCCGT TCCGTAAGAA 

18251 GTAAAGCAAC AAAATGGAAA GCTAGAAAGT CAAGTGGAAA TGCAATTTTT 
CATTTCGTTG TTTTACCTTT CGATCTTTCA GTTCACCTTT ACGTTAAAAA 

18301 CTCAACTACT GAGGCAGCCG CAGGCAATGG TGATAACTTG ACTCCTAAAG 
GAGTTGATGA CTCCGTCGGC GTCCGTTACC ACTATTGAAC T6AGGATTTC 

18351 TGGTATTGTA CAGTGAAGAT GTAGATATAG AAACCCCAGA CACTCATATT 
ACCATAACAT GTCACTTCTA CATCTATATC TTTGGGGTCT GTGAGTATAA 

18401 TCTTACATGC CCACTATTAA GGAAGGTAAC TCACGAGAAC TAATGGGCCA 
AGAATGTACG GGTGATAATT CCTTCCATTG AGTGCTCTTG ATTACCCGGT 

18451 ACAATCTATG CCCAACAGGC CTAATTACAT TGCTTTTAGG GACAATTTTA 
TGTTAGATAC GG6TTGTCCG GATTAATGTA ACGAAAATCC CTGTTAAAAT 

18501 TTGGTCTAAT GTATTACAAC AGCACGGGTA ATATGGGTGT TCTGGCGGGC 
AACCAGATTA CATAATGTTG TCGTGCCCAT TATACCCACA AGACCGCCCG 

16551 CAAGCATCGC AGTTGAATGC TGTTGTAGAT TTGCAAGACA GAAACACAGA 
GTTCGTAGCG TCAACTTACG ACAACATCTA AACGTTCTGT CTTTGTGTCT 

18501 GCTTTCATAC CAGCTTTTGC TTGATTCCAT TGGTGATAGA ACCAGGTACT 
CGAAAGTATG GTCGAAAAC6 AACTAAGGTA ACCACTATCT TG6TCCATGA 

18651 TTTCTATGTG GAATCAGGCT GTTGACAGCT ATGATCCA6A TGTTAGAATT 
AAAGATACAC CTTAGTCCGA CAACTGTCGA TACTAGGTCT ACAATCTTAA 

18701 ATTGAAAATC ATGGAACTGA AGATGAACTT CCAAATTACT GCTTTCCACT 
TAACTTTTAG TACCTTGACT TCTACTTGAA GGTTTAATGA CGAAAGGTGA 

18751 GGGAGGTGTG ATTAATACAG AGACTCTTAC CAAGGTAAAA CCTAAAACAG 
CCCTCCACAC TAATTATGTC TCTGAGAATG GTTCCATTTT GGATTTTGTC 

18801 6TCAGGAAAA.T6GATG6GAA AAAGATGCTA CAQAATTTTC A6ATAAAAAT 
CAGTCCTTTT ACCTACCCTT TTTCTACiSAT GTCTTAAAAG TCTATTTTTA 

18851 GAAATAAGAG TTGGAAATAA TTTTGCCATG GAAATCAATC TAAATGCCAA 
CTTTATTCTC AACCTTTATT AAAACGGTAC CTTTAGTTAG ATTTACGGTT 
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18902 CCTGTGGAGA J^l^TCCTGT ACTCCAACAT AGCGCT6TAT TTUCCCVA 
GGACACCTCT TTAAAGGACA TGAGGTTGTA TCGCGACATA AACG6GCTGT 

18951 AGCTAAAGTA CAGTCCTTCC AACGTAAAAA TTTCTGATAA CCCAAACACC 
TCGATTTCAT GTCAGGAAG6 TTGCATTTTT AAAGACTATT GGGTTTGTGG 

19001 TACGACTACA TGAACAAGCG AGTGGTGGCT CCCGGGCTAG TGGACTGCTA 
ATCCTGATCT ACTTCTTCGC TCACCACCGA GGGCCCGATC ACCTGACGAT 

19051 CATTAACCTT GGAGCACGCT GGTCCCTTGA CTATATGGAC AACGTCAACC 
GTAATTGGAA CCTCGTGCGA OCAGGGAACT GATATACCTG TTGCAGTTGG 

19101 CATTTAACCA CCACCGCAAT GCTGGCCTGC GCTACCGCTC AATGTTGCTG 
GTAAATTGGT GGTGGC6TTA CGACCGGACG CGAT6GCGAG TTACAACGAC 

19151 GGCAATGGTC GCTAOXSTGCC CTTCCACATC CAGGTGCCTC AGAAGTTCTT 
CCGTTACCA6 CGATACACGG GAAGGTGTAG GTCCACG6AG TCTTCAAGAA 

19201 TGCCATTAAA AACCTCCTTC TCCTGCCGGG CTCATACACC TACGAGTGGA 
ACGGTAATTT TTGGAGGAAG AGGACGGCCC GAGTATGTGG ATGCTCACCT 

19251 ACTTCAGGAA GGATGTTAAC ATGGTTCTGC AGAGCTCCCT AGGAAATGAC 
TGAAGTCCTT CCTACAATTG TACCAAGACG TCTCGAGGGA TCCTTTACTG 

19301 CTAAGGGTTG AC6GAGCCAG CATTAAGTTT 6ATAGCATTT GCCTTTACGC 
GATTCCCAAC T6CCTCGGTC GTAATTCAAA CTATCGTAAA CGGAAATGCG 

19351 CACCTTCTTC CCCATGGCCC ACAACACCGC CTCCACGCTT GAGGCCATGC 
GTGGAAGAAG GGGTACCGGG TGTTGTGGCG GAGGTGCGAA CTCCGGTACG 

19601 TTAGAAACGA CACCAACGAC CAGTCCTTTA ACGACTATCT CTCCGCCGCC 
AATCTTTGCT GTGGTTGCTG GTCAGGAAAT TGCTGATAGA GAGGCGGCGG 

19451 AACATGCTCT ACCCTATACC CGCCAACGCT ACCAACGTGC CCATATCCAT 
TTGTACGAGA TGGGATATGG GCGGTTGCGA TGGTTGCACG GGTATAGGTA 

19501 CCCCTCCCGC AACTGGGCGG CTTTCCGCGG CTGGGCCTTC ACGCGCCTTA 
GGGGAGGGCG 7TGACCCGCC GAAAGGCGCC GACCCGGAAG TGCGCGGAAT 

19551 AGACTAAGGA AACCCCATCA CTGGGCTCGG GCTACGACCC TTATTACACC 
TCTGATTCCT TTGGGGTAGT GACCCGAGCC CGATGCTGGG AATAATGTGG 

19601 TACTCTGGCT CTATACCCTA CCTAGATGGA ACCTTTTACC TCAACCACAC 
ATGAGACCGA GATATGGGAT GGATCTACCT TGGAAAATGG AGTTGGTGTG 

19651 CTTTAAGAAG GTGGCCATTA CCTTTGACTC TTCTGTCAGC TGGCCTGGCA 
6AAATTCTTC CACC66TAAT GGAAACT6AG AAGACAGTCG ACCGGACC6T 

19701 ATGACCGCCT GCTTACCCCC AACGAGTTTG AAATTAA6CG CTCAGTTGAC 
TACTGGCGGA CGAATGGGGG TTGCTCA7AC TTTAATTCGC GAGTCAACTG 

19751 GGGGA6GGTT ACAACGTTGC CCAGTGTAAC AT6ACCAAAG ACTGGTTCCT 
CCCCTCCCAA TGTTGCAACG GGTCACATTG TACTGGTTTC TGACCAAGGA 

19801 GGTACAAATG CTAGCTAACT ATAACATTGG CTACCAGG6C TTCTATATCC 
CCATGTTTAC GATCGATTGA TATOGTAACC GATGGTCCCG AAGATATAGG 




116/144 



wo 02/22080 



PCT/USOl/28861 



19851 CAGAGAGCTA Cj^ptSACCGC ATGTACTCCT TCTTTAGAAA CTTCCA(^pt 
GTCTCTCGAT GTTCCTGGCG TACATGAGGA AGAAATCTTT GAAGGTCGGG 

19901 ATGAGCCGTC AGGT6GTGGA TGATACTAAA TACAAGGACT ACCAACAGGT 
TACTCGGCAG TCCACCACCT ACTATGATTT ATGTTCCT6A TG6TTGTCCA 

19951 GGGCATCCTA CACCAACACA ACAACTCTGG ATTTGTTGGC TACCTTGCCC 
CCCGTAGGA7 GTGGTTG7GT TGTTGAGACC TAAACAACCG ATGGAACGGG 

20001 CCACCATGCG CGAAGGACAG GCCTACCCTG CTAACTTCCC CTATCCGCTT 
GGTGGTACGC GCTTCCTGTC CGGATGGGAC GATTGAAGGG GATAGGCGAA 

20051 ATAGGCAAGA CCGCAGTTGA CAGCATTACC CAGAAAAAGT TTCTTTGCGA 
TATCCGTTCT GQCGTCAACT GTCGTAAT6G GTCTTTTTCA AAGAAACGCT 

20101 TCGCACCCTT TGGC6CATCC CATTCTCCAG TAACTTTATG TCCATGGGCG 
AGCGTGGGAA ACCGCGTA6G 6TAAGAGGTC ATTGAAATAC AGGTACCCGC 

20151 CACTCACAGA CCTG6GCCAA AACCTTCTCT ACGCCAACTC CGCCCACGCG 
GTGAGTGTCT GGACCCGGTT TTGGAAGAGA TGCGGTTGA6 GCGGGTGCGC 

20201 CTAGACATGA CTTTTGAGGT GGATCCCATG GACGAGCCCA CCCTTCTTTA 
GATCTGTACT GAAAACTCCA CCTA6GGTAC CTGCTCGG6T GGGAAGAAAT 

20251 TGTTTTGTTT GAAGTCTTTG ACGTGGTCCG TGTGCACCAG CCGCACCGCG 
ACAAAACAAA CTTCAGAAAC TGCACCAGGC ACACGTGGTC GGCGTGGCGC 

20301 GCGTCATCGA AACCGTGTAC CTGCGCACGC CCTTCTC6GC CGGCAACGCC 
CQCAGTAGCT TTGGCACATG GACGCGTGCG GGAAGAGCCG GCCGTTGCGG 

4 

20351 ACAACATAAA GAAGCAAGCA ACATCAACAA CAGCTGCC6C CATGGGCTCC 
TGTTGTATTT CTTCGTTCGT TGTAGTTGTT GTCGACGGCG GTACCCGAGG 

20401 AGTGAGCAGG AACTGAAAGC CATTGTCAAA GATCTTGGTT GTGGGCCATA 
TCACTCGTCC TTGACTTTCG GTAACAGTTT CTAGAACCAA CACCCGGTAT 

20451 TTTTTTGGGC ACCTATGACA AGCGCTTTCC AGGCTTTGTT TCTCCACACA 
AAAAAACCCG TGGATACTGT TCGCGAAAGG TCCGAAACAA AGAGGTGTGT 

20501 AGCTCGCCTG CGCCATAGTC AATACGGCCG GTCGCGAGAC TGGGGGCGTA 
TCGAGCGGAC GCGGTATCAG TTATGCCGGC CAGCGCTCTG ACCCCCGCAT 

20551 CACTGGATGG CCTTTGCCTG GAACCCX3CAC. TCAAAAACAT GCTACCTCTT 
GTGACCTACC GGAAACGGAC CTTGGGCGT6 AGTTTTTGTA CGATGGAGAA 

20601 TGAGCCCTTT GGCTTTTCTG ACCAGCGACT CAAGCAGGTT TACCAGTTTG 
ACTCGGGAAA CC6AAAAGAC TG6TCGCTGA GTTCGTCCAA ATGGTCAAAC 

20651 AGTACGAGTC ACTCCTGCGC CGTAGCGCCA TTGCTTCTTC CCCCGACCGC 
TCATGCTCAG TGAGGACGCG GCATCGCGGT AACGAAGAAG GGGGCTGGCG 

20701 TGTATAACGC TGGAAAAGTC CACCCAAAGC GTACAGGG6C CCAACTCGGC 
ACATATTGCG ACCTTTTCAG GTGGGTTTCG CATGTCCCCG GGTTCAGCCG 

20751 CGCCTGTGGA CTATTCTGCT GCATGTTTCT CCACGCCTTT GCCAACTGGC 
GCGGACACC7 GATAAGACGA C6TACAAAGA GGTGCGGAAA CGGTTGACCG 
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20801 CCCAAACTCC d^GATCAC AACCCCACCA TGAACCTTAT TACCGG^UT 
GGGTTTGAGG'GTACCTAGTG TTGGGGTGGT ACTTGGAATA ATGGCCCCAT 

20851 CCCAACTCCA TGCTCAACAG TCCCCAGGTA CAGCCCACCC TGC6TCGCAA 
GGGTTGAGGT ACGAGTT6TC AGGGGTCCAT 6TCGGGTGGG ACGCAGCGTT 

20901 CCAGGAACAG CTCTACAGCT TCCTGGAGCG CCACTCGCCC TACTTCCGCA 
GGTCCTTGTC GAGATGTCGA AGGACCTCGC GGTGAGCGGG ATGAAGGCGT 

20951 GCCACAGTGC GCAGATTAGG AGCGCCACTT CTTTTTGTCA CTTGAAAAAC 
. CGGTGTCACG CGTCTAATCC TCGCGGTGAA GAAAAACAGT 6AACTTTTTG 

21001 ATGTAAAAAT AATGTACTAG AGACACTTTC AATAAAGGCA AATGCTTTTA 
TACATTTTTA TTACATGATC TCTGTGAAAG TTATTTCCGT TTACGAAAAT 

21051 TTTGTACACT CTCGGGTGAT TATTTACCCC CACCCTTGCC GTCTGCGCCG 
AAACATGTGA GAGCCCACTA ATAAATGGGG GTGGGAAC6G CAGACGCGGC 

21101 TTTAAAAATC AAAGGGGTTC TGCCGCGCAT CGCTATGCGC CACTGGCAGG 
AAATTTTTAG TTTCCCCAAG ACGGCGCGTA GCGATACGCG GTGACCGTCC 

21151 GACACGTTGC GATACTGGTG TTTAGTGCTC CACTTAAACT CAGGCACAAC 
CTGTGCAACG CTATGACCAC AAATCACGAG GTGAATTTGA GTCCGTGTTG 

21201 CATCCGCGGC AGCTCGGTGA AGTTTTCACT CCACAGGCTG CGCACCATCA 
GTAGGCGCCG TCGAGCCACT TCAAAAGTGA GGTGTCCGAC GCGTGGTAGT 

21251 CCAACGCGTT TAGCAGGTCG GGCGCCGATA TCTTGAAGTC GCAGTTGGGG 
GGTTGCGCAA ATCGTCCAGC CCGCGGCTAT AGAACTTCAG CGTCAACCCC 

21301 CCTCCGCCCT GCGCGCGCGA GTTGCGATAC ACAGGGTTGC AGCACTGGAA 
GGAGGCGGGA CGCGCGCGCT CAACGCTATG TGTCCCAACG TCGTGACCTT 

21351 CACTATCAGC GCCGGGTGGT GCACGCTGGC CAGCACGCTC TTGTCGGAGA 
GTGATAGTCG CGGCCCACCA CGTGCGACCG GTCGTGCGAG AACAGCCTCT 

21401 TCAGATCCGC GTCCAGGTCC TCCGCGTTGC TCAGGGCGAA CGGAGTCAAC 
AGTCTAGGCG CAOGTCCAGG AGGCGCAACG AGTCCCGCTT GCCTCAGTTG 

21451 TTTGGTAGCT GCCTTCCCAA AAAGGGCGCG TGCCCAGGCT TTGAGTTGCA 
AAACCATCGA C6GAAG6GTT TTTCCCGCGC ACGGGTCCGA AACTCAACGT 

21501 CTC6CACCGT AGTGGCATCA AAAGGTGACC GTGCCCGGTC TGGGCGTTAG 
GAGC6TGGCA TCACCGTAGT TTTCCACTGG CACGGGCCAG ACCCGCAATC 

21551 GATACAGCGC CTGCATAAAA GCCTTGATCT GCTTAAAAGC CACCTGAGCC 
CTATGTCGCG GACGTATTTT CGGAACTAGA CGAATTTTCG GTGGACTCGG 

21601 TTTGCGCCTT CAGAGAAGAA CATGCCGCAA GACTTGCCGG AAAACTGATT 
AAACGCGGAA GTCTCTTCTT GTACGGCGTT CTGAACGGCC TTTTGACTAA 

21651 GGCCGGACAG GCCGCGTCGT GCACCCAGCA CCTTGCGTCG GTGTTGGAGA 
CCGGCCTGTC CGGCGCAGCA CGT6CGTC6T GQAACGCAGC CACAACCTCT 

21*701 TCTGCACCAC ATTTCGGCCC CACCGGTTCT TCACGATCTT 6GCCTTGCTA 
AGACGTGGTG TAAAGCCGGG GTGGCCAAGA AG7GCTAGAA CCGGAAC6AT 
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2n51 GACTGCTCCT Tl^CGCGCG CTGCCCGTTT TCGCTCGTCA CA'i'UL'A'^pc 

CTGACGAGGA AGTCGCGCGC GACGGGCAAA AGCGAGCAGT GTAGGTAAAG 

21801 AATCACGTGC TCCTTATTTA TCATAATGCT TCCGTGTAGA CACTTAAGCT 
TTAGTGCACG AGGAATAAAT AGTATTACGA AG6CACATCT GTGAATTCGA 

21851 CGCCTTCGAT CTCAGCGCAQ CGGTGCAGCC ACAACGCGCA GCCCGTGGGC 
CCGGAA6CTA GAGTCGCGTC GCCACGTC6G TGTTGCGCGT CGGGCACCCG 

21901 TCGTGATGCT TGTAGGTCAC CTCTGCAAAC GACTGCAGGT ACGCCTGCAG 
AGCACTAC6A ACATCCAGTG GAGACGTTTG CTGACGTCCA TGCGGACGTC 

21951 GAATCGCCCC ATCATCGTCA CAAAGGTCTT GTTGCTGGTG AAGGTCAGCT 
CTTAGCGGGG TAGTAGCAGT GTTTCCAGAA CAACGACCAC TTCCAGTCGA 

22001 GCAACCCGCG GTGCTCCTCG TTCAGCCAGG TCTTGCATAC GGCCGCCAGA 
CGTTOGGCGC CACGAGGAGC AA6TCGGTCC AGAACGTATG CCGGCGGTCT 

22051 6CTTCCACTT GGTCAGGCAG TAGTTTGAAG TTCGCCTTTA 6ATCGTTATC 
CGAAGGT6AA CCAGTCCGTC ATCAAACTTC AAGCGGAAAT CTAGCAATAG 

22101 CACGTGGTAC TTGTCCATCA GCGCGCGCGC AGCCTCCATG CCCTTCTCCC 
GTGCACCATG AACAGGTAGT CGCGCGCGCG TCGGAG6TAC GGGAAGAGGG 

22151 ACGCAGACAC GATCGGCACA CTCAGCGGGT TCATCACCGT AATTTCACTT 
TGCGTCTGTG CTAGCCGTGT GAGTCGCCCA AGTAGTGGCA TTAAAGTGAA 

22201 TCCGCTTCGC TGGGCTCTTC CTCTTCCTCT TGCGTCCGCA TACCACGCGC 
AGGCGAAGCG ACCCGAGAAG GAGAAGGAGA ACGCAGGCGT ATGGTGCGCG 

22251 CACTGGGTCG TCTTCATTCA GCCGCCGCAC TGTGCGCTTA CCTCCTTTGC 
GTGACCCAGC AGAAGTAAGT CGGCGGCGTG ACACGCGAAT GGAGGAAACG 

22301 CATGCTTGAT TAGCACCGGT GGGTTGCTGA AACCCACCAT TTGTAGCGCC 
GTACGAACTA ATCGTGGCCA CCCAACGACT TTGGGTGGTA AACATCGCGG 

■22351 ACATCTTCTC TTTCTTCCTC GCTGTCCACG ATTACCTCTG GTGATGGCGG 
TGTAGAAGAG AAAGAAGGAG CGACAGGTGC TAATGGAGAC CACTACCGCC 

22401 GCGCTCGGGC TTGGGAGAAG GGCGCTTCTT TTTCTTCTTG GGCGCAATGG 
CGCGAGCCCG AACCCTCTTC CCGCGAAGAA AAAGAAGAAC CCGCGTTACC 

22451 CCAAATCCGC C6CCGAGGTC GATGGCCGCG GGCTGGGTGT GCGCGGCACC 
GGTTTAGGCG GCGGCTCCAG CTACCG6CGC CCGACCCACA C6C6CCGTGG 

22501 AGCGCGTCTT GTGATGAGTC TTCCTCGTCC TCGGACTCGA TACGCCGCCT 
TCGCGCA6AA CACTACTCAG AAGGAGCAGG AGCCTGAGCT ATGCGGCGGA 

22551 CATCCGCTTT TTTGGGGGCG CCCGGGGAGG CGGCGGCGAC GGGGACGGGG 
GTAGGCGAAA AAACCCCCGC GGGCCCCTCC GCCGCCGCTG CCCCTGCCCC 

22601 ACGACACGTC CTCCATGGTT GGGGGACGTC GCGCCGCACC GCGTCCGCGC 
TGCTGTGCAG GAGGTACCAA CCCCCTGCAG CGCGGCGTGG CGCAGGCGCG 

22651 TCGGGGGTGG TTTCGCGCTG CTCCTCTTCC CGACTGGCCA TTTCCTTCTC 
A6CCCCCACC AAAGCGCGAC GAGGAGAAGG GCTGACCGGT AAAGGAAGAG 
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22701 CTATAGGCAG AJ^plGATCA TGGAGTCAGT CGAGAAGAAG GACAGC^pv 
GATATCCGTC TTTTTCTAGT ACCTCAGTCA GCTCTTCTTC CTGTCGGATT 

22751 CCGCCCCCTC TGAGTTCGCC ACCACCGCCT CCACCGATGC CGCCAACGCG 
GGCGGGGGAG ACTCAAGCG6 TGGTGGCG6A GGTGGCTACG GCGGTTGCGC 

22801 CCTACCACCT TCCCCGTCGA GGCACCXICCG CTT6AGGAGG AGGAAGTGAT 
GGATGGTGGA AGGGGCAGCT CCGTGGGGGC GAACTCCTCC TCCTTCACTA 

22851 TATCGAGCAG GACCCAGGTT TTGTAAGCGA AGACGACGAG GACCGCTCAG 
ATAGCTCGTC CTX3GGTCCAA AACATTCGCT TCTGCTGCTC CTGGCGAGTC 

22901 TACCAACAGA 6GATAAAAAG CAAGACCAGG ACAACGCAGA GGCAAACGAG 
ATGGTTGTCT CCTATTTTTC GTTCTGGTCC TGTTGCGTCT CC6TTTGCTC 

22951 GAACAAGTCG GOCGGGGGGA CGAAAOGCAT GGCGACTACC TAGATGTGG6 
CTTGTTCAGC CCGCCCCCCT GCTTTCCGTA CCCCTGATCG ATCTACACCC 

23001 AGACGACGTG CTGTTGAAGC ATCTGCAGCG CCAGTGC6CC ATTATCTGCG 
TCTGCTGCAC GACAACTTCG TAGACGTCGC GGTCACGC6G TAATAGACGC 

23051 ACGCGTTGCA AGAGCGCAGC GATGTGCCCC TCGCCATAGC GGATGTCAGC 
TGCGCAACGT TCTCGCGTCG CTACACGGGG AGCGGTATCG CCTACAGTCG 

23101 CTTGCCTACG AACGCCACCT ATTCTCACCG CGCGTACCCC CCAAACGCCA 
GAACGGATGC TTGCGGTGGA TAAGAGTGGC GCGCATGGGG GGTTTGCGGT 

23151 AGAAAACGGC ACATGCGAGC CCAACCCGCG CCTCAACTTC TACCCCGTAT 
TCTTTTGCCG TGTACGCTC6 GGTTGGGCGC GGAGTTGAAG ATGGGGCATA 

23201 TTGCCGTGCC AGAGGTGCTT GCCACCTATC ACATCTTTTT CCAAAACTGC 
AACGGCACGG TCTCCACGAA CGGTGGATAG TGTAGAAAAA GGTTTTGACG 

23251 AAGATACCCC TATCCTGCCG TGCCAACCGC AGCCGAGCGG ACAAGCAGCT 
TTCTATGGGG ATAGGACGGC ACGGTTGGCG TCGGCTCGCC TGTTCGTCGA 

23301 GGCCTTGCGG CAGGGCGCTG TCATACCTGA TATCGCCTCG CTCAACGAAG 
CCGGAACGCC GTCCCGCGAC AGTATGGACT ATAGCGGAGC 6AGTTGCTTC 

23351 TGCCAAAAAT CTTTGAGGGT CTTGGACGCG ACGAGAAGCG CGCGGCAAAC 
ACGGTTTTTA GAAACTCCCA GAACCTGCGC TGCTCTTCGC GCGCCGTTTG 

23401 GCTCTGCAAC AGGAAAACA6 CGAAAAT6AA AGTCACTCTG GAGTGTTGGT 
CGAGACGTTG TCCTTTTGTC GCTTTTACTT TCAGTGAGAC CTCACAACCA 

23451 GGAACTCGAG GGTGACAACG CGCGCCTA6C CGTACTAAAA C6CAGCATCG 
CCTTGAGCTC CCACTGTTGC GCGCGGATCG GCATGATTTT GC6TCGTAGC 

23501 AGGTCACCCA CTTTGCCTAC CCGGCACTTA ACCTACCCCC CAAGGTCATG 
TCCAGTGGGT GAAACGGATG GGCCGTGAAT TGGATGGGGG GTTCCAGTAC 

23551 AGCACAGTCA TGAGTGAGCT GATCGTGCGC CGTGCGCAGC CCCTGGAGAG 
TCGTGTCAGT ACTCACTCGA CTAGCACGtG GCACGCGTCG GGGACCTCTC 

23601 GGATGCAAAT TTGCAAGAAC AAACAGAGGA GGGCCTACCC GCAGTTGGCG 
CCTACGTTTA AACGTTCTTG TTTGTCTCCT CCCGGATGGG CGTCAACCGC 
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23651 ACGAGCAGCT A||pCGCTGG CTTCAAACGC GCGAGCCTGC CGACTT^ps 
TGCTCGTCGA TCGCGCGACC GAAGTTTGCG CGCTCGGACG GCTGAACCTC 

23701 GAGCGAC6CA AACTAATGAT GGCCGCAGTG CTCGTTACCG T66AGCTTGA 
CTCGCT6CGT TTGATTACTA CCGGCGTCAC GAGCAAT66C ACCTCGAACT 

23*751 6TGCATGCAG CGGTTCTTTG CTGACCCGGA GATGCAGCGC AAGCTAGAGG 
CACGTACGTC 6CCAAGAAAC GACTGGGCCT CTACGTCGCG TTCGATCTCC 

23801 AAACATTGCA CTACACCTTT CGACA6GGCT ACGTACGCCA GGCCTGCAAG 
TTTGTAACGT GATGTGGAAA GCTGTCCCGA TGCATGCGGT CCGGACGTTC 

23851 ATCTCCAACG T6GAGCTCTG CAACCTGGTC TCCTACCTT6 GAATTTTGCA 
TAGAGGTT6C ACCTCGAGAC GTTGGACCA6 AGGATGGAAC CTTAAAACGT 

23901 C6AAAACCGC CTTG6GCAAA ACGTGCTTCA TTCCACGCTC AAG6GCGAGG 
GCTTTTGGCG GAACCCCTTT TCCACGAACT AACCTCCCAG TTCCCGCTCC 

23951 CGCGCCGCGA CTACGTCCGC GACTGCGTTT ACTTATTTCT ATGCTACACC 
GCGCGGCGCT GATGCAGGCG CTGACGCAAA TCAATAAAGA TACGATGTGG 

24001 TGGCAGACGG CCATGGGCGT TTGGCAGCAG TGCTTGGAGG AGTGCAACCT 
ACCGTCTGCC GGTACCCGCA AACCGTCGTC ACGAACCTCC TCACGTTGGA 

24051 CAAGGAGCTG CAGAAACTGC TAAAGCAAAA CTTGAAGGAC CTATGGACGG 
GTTCCTCGAC GTCTTTGACG ATTTCGTTTT GAACTTCCTG GATACCTGCC 

24101 CCTTCAACGA GCGCTCCGTG GCCGCGCACC TGGCGGACAT CATTTTCCCC 
GGAAGTTGCT CGCGAGGCAC CGGCGCGTGG ACCGCCTGTA GTAAAAGGGG 

24151 GAACGCCTGC TTAAAACCCT GCAACAGGGT CTGCCAGACT TCACCAGTCA 
CTTGCGGACG AATTTTGGGA CGTTGTCCCA GACGGTCTGA AGTGGTCAGT 

24201 AAGCATGTTG CAGAACTTTA GGAACTTTAT CCTAGAGCGC TCAGGAATCT 
TTCGTACAAC GTCTTGAAAT CCTTGAAATA GGATCTCGCG AGTCCTTAGA 

24251 TGCCCGCCAC CTGCTGT6CA CTTCCTAGCG ACTTTGTGCC CATTAAGTAC 
ACGGGCGGTG GACGACACGT GAAGGATCGC TGAAACACGG GTAATTCATG 

24301 CGCGAATGCC CTCCGCCGCT TTGGGGCCAC TGCTACCTTC TGCAGCTAGC 
GCGCTTACGG GAGGCGGCGA AACCCCGGTG ACGATGGAAG ACGTCGATCG 

24351 CAACTACCTT GCCTACCACT CTGACATAAT GGAAGACGTG AGCGGTGACG 
GTTGATGGAA CGGATGGTGA GACTGTATTA CCTTCTGCAC TCGCCACTGC 

24401 GTCTACTGGA GTGTCACTGT CGCTGCAACC TATGCACCCC GCACCGCTCC 
CAGAT6ACCT CACAGTGACA GCGACGTTGG ATACGTGGGG CGT6GCGAGG 

24451 CTGGTTTGCA ATTCGCAGCT GCTTAACGAA AGTCAAATTA TCGGTACCTT 
GACCAAACGT TAAGCGTCGA CGAATTGCTT TCAGTTTAAT AGCCATGGAA 

24501 TGAGCTGCAG GGTCCCTCGC CTGACGAAAA GTCCGCGGCT CCGGGGTTGA 
ACTCGACGTC CCA0GGAGC6 GACTGCTTTT CAGGCGCCGA GGCCCCAACT 

24551 AACTCACTCC GGGQCTGTGG ACGTCGGCTT ACCTTCGCAA. ATTTGTACCT 
TTGAGT6AGG CCCCGACACC TGCAGCCGAA TGGAAGCGTT TAAACATGGA 
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24601 GAGGACTACC A^SBTCCACGA GATTAGGTTC TACGAAGACC AATCCC(2iC'C 
CTCCTGATGG TGCGGGTGCT CTAATCCAAG ATGCTTCTGG TTAGG6CGGG 

24651 6CCTAATGCG GAGCTTACCG CCTGCGTCAT TACCCAGGGC CACATTCTTG 
CGGATTACGC CTCGAATGGC GGACGCAGTA ATGGGTCCCG GTGTAAGAAC 

24701 GCCAATTGCA AGCCATCAAC AAAGCCCGCC AAGAGTTTCT GCTACGAAAG 
CGGTTAACGT TCGGTAGTTG TTTCGGGCGG TTCTCAAAGA C6ATGCTTTC 

24751 6GACGGGGGG TTTACTTGGA CCCCCAGTCC GGC6AGGAGC TCAACCCAAT 
CCTGCCCCCC AAATGAACCT GGGGGTCA6G CCGCTCCTCG AGTTGGGTTA 

24801 CCCCCCGCCG CCGCAGCCCT ATCAGCAGCA GCCGCGGGCC CTTGCTTCCC 
GGGGGGCGGC GGCGTCGGGA TAGTCGTCGT CGGCGCCCGG GAACGAAGGG 

24851 AGGATGGCAC CCAAAAAGAA GCTGCAGCTG CCGCCGCCAC CCACG6ACGA 
TCCTACCGTG GGTTTTTCTT CGACGTCCAC GGCCGCGGTG G6TGCCTGCT 

24901 GGAGGAATAC TGGGACAGTC AGGCAGAG6A GGTTTTGGAC GAGGAGGAGG 
CCTCCTTATG ACCCT6TCAG TCCGTCTCCT CCAAAACCTG CTCCTCCTCC 

24951 AGGACATGAT 66AAGACTGG GAGAGCCTAG ACGAGGAAGC TTCCGA6GTC 
TCCTGTACTA CCTTCTGACC CTCTCGGATC TGCTCCTTCG AAGGCTCCAG 

25001 GAAGAGGTGT CAGACGAAAC ACCGTCACCC TCGGTCGCAT TCCCCTCGCC 
CTTCTCCACA GTCTGCTTT6 TGGCAGTG6G A6CCAGCGTA AGGGGAGCGG 

25051 GGCGCCCCAG AAATCGGCAA CCGGTTCCAG CATGGCTACA ACCTCCGCTC 
CCGCGGGGTC TTTAGCCGTT GGCCAAGGTC GTACCGATGT TGGAGGCGAG 

25101 CTCAGGCGCC GCCGGCACTG CCCGTTCGCC GACCCAACCG TAGATGGGAC 
GAGTCCGCGG CGGCCGTGAC GGGCAAGCGG CTGGGTTGGC ATCTACCCT6 

25151 ACCACTGGAA CCAGGGCCGG TAAGTCCAAG CAGCCGCCGC CGTTAGCCCA 
TGGTGACCTT GGTCCCGGCC ATTCAGGTTC GTCGGCGGCG GCAATCGGGT 

25201 AGAGCAACAA CAGCGCCAAG GCTACCGCTC ATGGCGCGGG CACAAGAACG 
TCTCGTTGTT GTCGCGGTTC CGATGGCGAG TACCGCGCCC GTGTTCTTGC 

25251 CCATAGTTGC TTGCTTGCAA GACTGTGGGG GCAACATCTC CTTCGCCCGC 
GGTATCAACG AACGAACGTT CTGACACCCC CGTTGTAGAG GAAGCGGGCG 

25301 CGCTTTCTTC TCTACCATCA CGGCGTGGCC TTCCCCCGTA ACATCCTGCA 
6CGAAAGAAG AGATGGTAGT GCC6CACC6G AAGGGGGCAT TGTAGGACGT 

25351 TTACTACCGT CATCTCTACA GCCCATACTG CACCGGC6GC AGCGGCAGCA 
AATGATGGCA GTAGA6ATGT CGGGTATGAC GT6GCCGCCG TCGCCGTCGT 

25401 ACAGCAGCGG CCACACAGAA GCAAAGGCGA CCGGATAGCA AGACTCTGAC 
TGTCGTCGCC GGTGTGTCTT CGTTTCCGCT GGCCTATCGT TCTGA6ACTG 

25451 AAAGCCCAAG AAATCCACAG CGGCGGCAGC AGCAGGAGGA GGAGCGCTGC 
TTTCGGGTTC TTTAGGTGTC GCCGCCGTCG TCGTCCTCCT CCTCGCGACG 

25501 GTCTGGCGCC CAACGAACCC GTATCGACCC GCGAGCTTAG AAACAGGATT 
CACACCCCGG GTTGCTTGGG CATAGCTGGG CGCTCGAATC TTTGTCCTAA 
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25551 TTTCCCACTC TSWrGCTAT ATTTCAACAG AGCAGGGGCC AAGAACJ^t^ 
AAAGGGTGAG ACATACGATA TAAAGTTGTC TCGTCCCCGG TTCTTGTTCT 

25601 GCTGAAAATA AAAAACAGGT CTCTGCGATC CCTCACCCGC AGCTGCCTGT 
CGACTTTTAT TTTTTGTCCA GAGACGCTAG GGAGTGGGCG TCGACGGACA 

25651 ATCACAAAAG CGAAGATCAG CTTCGGCGCA CGCTGGAAGA CGCGGAGGCT 
TAGTGTTTTC GCTTCTAGTC GAAOCCGCGT GCGACCTTCT GCGCCTCCGA 

25^01 CTCTTCAGTA AATACTGCGC GCTGACTCTT AAGGACTAGT TTCGCGCCCT 
GAGAAGTCAT TTATGACGCG CTGACTGAGAA TTCCTGATCA AAGCGCGGGA 

25751 TTCTO^TT TAAGC6CGAA AACTACGTCA TCTCCAGCGG CCACACCCGG 
AAGA6TTTAA ATTC6CGCTT TTGATGCAGT AGAGGTCGCC GGTGTGGGCC 

25801 CGCCAGCACC TGTTGTCAGC 6CCATTATGA GCAAGGAAAT TCCCACGCCC 
GCGGTCGTGG ACAACAGTCG CGGTAATACT CGTTCCTTTA A6GGTGCG6G 

25851 TACATGTGGA 6TTACCAGCC ACAAATGGGA CTTGCGGCTG GAGCTGCCCA 
ATGTACACCT CAATGGTCGG TGTTTACCCT GAACGCCGAC CTCGACGGGT 

25901 AGACTACTCA ACCCGAATAA ACTACATGAG CGCGGGACCC CACATGATAT 
TCTGATGAGT TGGGCTTATT TGATGTACTC GCGCCCTGGG 6TGTACTATA 

25951 CCCGGGTCAA CGGAATACGC GCCCACCGAA ACCGAATTCT CCTGGAACAG 
GGGCCCAGTT GCCTTATGCG CGGGTGGCTT TGGCTTAAGA GGACCTTGTC 

26001 GCGGCTATTA CCACCACACC TCGTAATAAC CTTAATCCCC 6TAGTTGGCC 
CGCCGATAAT GGTGGTGTGG AGCATTATTG GAATTAGGGG CATCAACCGG 

26051 CGCTGCCCTG GTGTACCAGG AAAGTCCCGC TCCCACCACT GTGGTACTTC 
GC6ACGGGAC CACArGGTCC TTTCAGGGCG AGGGTGGTGA CACCATGAAG 

26101 CCAGAGACGC CCAGGCCGAA GTTCAGATGA CTAACTCAGG GGCGCAGCTT 
GGTCTCTGCG GGTCCGGCTT CAAGTCTACT GATTGAGTCC CCGCGTCGAA 

26151 GCGGGCGGCT TTCGTCACAG GGTGCGGTCG CCCGG6CAGG GTATAACTCA 
CGCCCGCCGA AAGCAGTGTC CCACGCCAGC 6GGCCCGTCC CATATTGAGT 

26201 CCT6ACAATC AGAC6GCGA6 GTATTCAGCT CAACGACGAG TC6GTGAGCT 
GGACT6TTAG TCTCCCGCTC CATAAGTCGA GTTGCTGCTC AGCCACTCGA 

26251 CCTCGCTTGG TCTCCGTCCG GACGGGACAT TTCAGATCGG CGGCGCCGGC 
GGAGCGAACC AGAQGCAGGC CTGCCCTGTA AAGTCTAGCC GCCGCGGCCG 

26301 CGCTCTTCAT TCACGCCTCG TCAGGCAATC CTAACTCTGC AGACCTCGTC 
GCGAGAAGTA AGTGCGGAGC AGTCCGTTA6 GATTGAGACG TCT6GAGCA6 

26351 CTCTGAGCCG CGCTCTGGAG GCATTGGAAC TCTGCAATTT ATTGA6GAGT 
GAGACTCGGC GCGAGACCTC CGTAACCTTG AGACGTTAAA TAACTCCTCA 

26401 TTGTGCCATC GGTCTACTTT AACCCCTTCT CGGGACCTCC CGGCCACTAT 
AACACGGTAG CCA6ATGAAA TTOGGGAAGA GCCCTGGAG6 GCCG6TGATA 

26451 CCGGATCAAT TTATTCCTAA CTTTGACGCG GTAAAGGACT CGGCGGACGG 
GGCCTAGTTA AATAAGGATT GAAACTGCGC CATTTCCTGA GCCGCCTGCC 
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26501 CTACGACTGX AIPtAAGTG GAGAGGCAGA GCAACT6CGC CTGAAA^C 
GATGCTGACT TACAATTCAC CTCTCCGTCT C6TTGACGCG 6ACTTTGTGG 

26551 TGGTCCACTG TCGCCGCCAC AAGTGCTTTG CCCGCGACTC CGGTGAGTTT 
ACCAGGTGAC AGCGGCGGTG TTCACGAAAC GGGCGCTGAG GCCACTCAAA 

26601 TGCTACTTTG AATTGCCCGA GGATCATATC GAGGGCCCGG CGCACGGCGT 
ACGATGAAAC TTAACCGCCT CCTAGTATAG CTCCCGGGCC GCGTGCCGCA 

26651 CCGGCTTACC GCCCAGGGAG AGCTTGCCCG TAGCCTGATT CGGGAGTTTA 
GGCC6AAT66 C6GGTCCCTC TCGAACG66C ATCGGACTAA GCCCTCAAAT 

26701 CCCAGC6CCC CCTGCTAGTT GA6CGGGACA GGGGACCCTG TGTTCTCACT 
GGGTCGCGGG GGACGATCAA CTCGCCCTGT CCCCTGGGAC ACAAGAGTGA 

26751 GTGATTTCCA ACTGTCCTAA CCCTGGATTA CATCAAGATC TTTGITGCCA 
CACTAAACGT TGACAGGATT 6GGACCTAAT GTAGTTCTAG AAACAACGGT 

26B01 TCTCTGTGCT GAGTATAATA AATACAGAAA TTAAAATATA CTGGGGCTCC 
AGAGACACGA CTCATATTAT TTATGTCTTT AATTTTATAT GACCCCGAGG 

26B51 TATCGCCATC CTGTAAACGC CACCGTCTTC ACCCGCCCAA GCAAACCAAG 
ATAGCGGTAG GACATTTGCG GTGGCA6AAG TGGGCGGGTT CGTTTGGTTC 

26901 GCGAACCTTA CCTGGTACTT TTAACATCTC TCCCTCTGTG ATTTACAACA 
CGCTTGGAAT GGACCATGAA AATTGTAGAG AGGGAGACAC TAAATGTTGT 

26951 GTTTCAACCC AGACGGAGTG AGTCTACGAG AGAACCTCTC CGAGCTCAGC 
CAAAGTTGGG TCTGCCTCAC TCAGATGCTC TCTTGGAGAG GCTCGAGTCG 

27001 TACTCCATCA GAAAAAACAC CACCCTCCTT ACCTGCCGGG AACGTACGAG 
ATGAGGTAGT CTTTTTTGTG GTGGGAGGAA TCGACGGCCC TTGCATGCTC 

27051 TGCGTCACCG GCCGCTGCAC CACACCTACC GCCTGACCGT AAACCAGACT 
ACGCAGTGGC CGGCGACGTG GTGTG6ATGG CGGACTGGCA TTTG6TCTGA 

27101 TTTTCCGGAC AGACCTCAAT AACTCTGTTT ACCAGAACAG GAGGTGAGCT 
AAAAGGCCTG TCTGGAGTTA TTGAGACAAA TGGTCTTGTC CTCCACTCGA 

27151 TAGAAAACCC TTAGGGTATT AGGCCAAAGG CCCAGCTACT GTGGCGTTTA 
ATCTTTTGGG AATCCCATAA TCCGGTTTCC GCGTCGATGA CACCCCAAAT 

27201 TGAACAATTC AAGCAACTCT ACGGGCTATT CTAATTCAGG TTTCTCTAGA 
ACTTGTTAAG TTCGTTGAGA TGCCCGATAA GATTAAGTCC AAAGAGATCT 

27251 ATCGGGGTTG GGGTTATTCT CTGTCTTGTG ATTCTCTTTA TTCTTATACT 
TA6CCCCAAC CCCAATAAGA GACAGAACAC TAAGAGAAAT AAGAATATGA 

27301 AACGCTTCTC TGCCTAAGGC TCGCCGCCTG CTGTGTGCAC ATTTGCATTT 
7T0CGAAGAG ACGGATTCCG AGCG6CGGAC GACACACGTG TAAACGTAAA 

27351 ATTGTCAGCT TTTTAAACGC TGGGGTCGCC ACCCAAGATG ATTAGGTACA 
TAACAGTCGA AAAATTTGCG ACCCCAOCGG TGGGTTCTAC TAATCCAT6T 

27401 TAATCCTAGG TTTACTCACC CTTGCGTCAG CCCACGGTAC CACCCAAAAG 
ATTAGGATCC AAATGA6TGG GAACGCAGTC GGGT6CCATG GTGGGTTTTC 
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27451 GTGGATTTTA- A^PsCCAGC CTGTAATGTT ACATTCGCAG CTGAAGfl^KA 
CACCTAAAAT TCXTTCGGTCG GACATTACAA TGTAAGCGTC GACTTCGATT 

27501 TGA6TGCACC ACTCTTATAA AATGCACCAC AGAACATGAA AAGCTGCTTA 
ACTCACGTGG T6AGAATATT TTACGTGGTG TCTTGTACTT TTCGACGAAT 

27551 TTCGCCACAA AAACAAAATT GGCAAGTATG CTGTTTATGC TATTTGGCAG 
AAGCGGTGTT TTTGTTTTAA CCGTTCATAC GACAAATACG ATAAACCGTC 

27601 CCAGGTGACA CTACAGAGTA TAATGTTACA GTTTTCCAGG GTAAAAGTCA 
GGTCCACTGT GATGTCTCAT ATTACAATGT CAAAAGGTCC CATTTTCAGT 

27651 TAAAACTTTT ATGTATACTT TTCCATTTTA TGAAATGT6C GACATTACCA 
ATTTTGAAAA TACATATGAA AAGGTAAAAT ACTTTACACG CTGTAATGGT 

27701 TGTACATGAC CAAACAGTAT AAGTT6TGGC CCCCACAAAA TTGTGTGGAA 
ACATGTACTC GTTTGTCATA TTCAACACCG 6GGGTGTTTT AACACACCTT 

27751 AACACTGGCA CTTTCTGCTG CACTGCTATG CTAATTACAG TGCTCGCTTT 
TTGTCACCGT GAAAGACGAC GTGACGATAC 6ATTAATGTC ACGAGCGAAA 

27801 GGTCTGTACC CTACTCTATA TTAAATACAA AAGCAGACGC AGCTTTATTG 
CCAGACATGG GATGAGATAT AATTTATGTT TTCGTCTGCG TCGAAATAAC 

27851 AGGAAAAGAA AATGCCTTAA TTTACTAAGT TACAAAGCTA ATGTCACCAC 
TCCTTTTCTT TTACGGAATT AAATGATTCA ATGTTTCGAT TACAGTGGTG 

27901 TAACTGCTTT ACTCGCTCCT TGCAAAACAA ATTCAAAAA6 TTAGCATTAT 
ATTGACGAAA TGAGCGACGA ACGTTTTGTT TAAGTTTTTC AATCGTAATA 

27951 AATTAGAATA GGATTTAAAC CCCCCQGTCA TTTCCTGCTC AATACCATTC 
TTAATCTTAT CCTAAATTTG GGGG6CCAGT AAAGGACGAG TTATCGTAAG 

28001 CCCTGAACAA TTGACTCTAT GTGGGATATG CTCCAGCGCT ACAACCTTGA 
GGGACTTGTT AACTGAGATA CACCCTATAC GAGGTCGCGA TGTTGGAACT 

28051 AGTCAGGCTT CCTGGATGTC AGCATCTGAC TTTGGCCAGC ACCTGTCCCG 
TCAGTCCGAA 6GACCTACAG TCGTAGACTG AAACCGGTC6 TG6ACAGGGC 

28101 C6GATTTGTT CCAGTCCAAC TACAGCGACC CACCCTAACA GAGATGACCA 
GCCTAAACAA GGTCAGGTTG ATGTCGCTGG GTGGGATTGT CTCTACTGGT 

28151 ACACAACCAA CGCGGCCGCC GCTACC6GAC TTACATCTAC CACAAATACA 
TGTGTTGGTT GCGCCGGCGG CGATGGCCTG AATGTAGATG GTGTTTATGT 

28201 CCCCAAGTTT CTGCCnTGT CAATAACTGG GATAACTTGG GCATGTGQTG 
GGGGTTCAAA GACGGAAACA GTTATTGACC CTATTGAACC CGTACACCAC 

28251 GTTCTCCATA GCGCTTATGT TTGTATGCCT TATTATTATG TGGCTCATCT 
CAAGAGGTAT CGCGAATACA AACATACGGA ATAATAATAC ACCGAGTAGA 

283 01 GCTGCCTAAA GCGCAAACGC GCCCGACCAC CCATCTATAG TCCCATCATT 
CGAC6GATTT CGC6TTTGCG CGGGCTGGTG 6GTAGATATC AGGGTAGTAA 

28351 GTGCTACACC CAAACAATGA TGGAATCCAT AGATTGGACG GACTGAAACA 
CACGATGTGG QTTT6TTACT ACCTTAGGTA TCTAACCTGC CTGACTTTGT 
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28401 CATGTTCTTT 't^^ACAG TATGATTAAA TGAGACATGA lTCCTC«f 
GTACAAGAAA AgIgAATGTC ATACTAATTT ACTCTGTACT AAGGAGctCa 

2B451 TTTTATATTA CTGACCCTTG TTGCGCTTTT TTGTGCGTGC TCCACATTGG 
AAAATATAAT GACTG6GAAC AACGCGAAAA AACACGCACG AGGTGTAACC 

28501 CTGCGGTTTC TCACATCGAA GTAGACTGCA TTCCAGCCTT CACAGTCTAT 
GACGCCAAAG AGTGTAGCTT CATCTGACGT AAGGTCGGAA GTGTCAGATA 

28551 TTGCTTTACG GATTTGTCAC CCTCACGCTC ATCTGCAGCC TCATCACTGT 
AACGAAATGC CTAAACAGTG GGAGTGCGAG TAGACGTCGG AGTAGTGACA 

28601 GGTCATCGCC TTTATCCAGT GCATTGACTG GGTCTGTGTG CGCTTTGCAT 
CCAGTAGCGG AAATAGGTCA CGTAACTGAC CCAGACACAC GCGAAACGTA 

28651 ATCTCAGACA CCATCCCCA6 TACAGGGACA GGACTATAGC TGAGCTTCTT 
TAGAGTCTGT GGTAGGGGTC ATGTCCCT6T CCTGATATCG ACTCGAAGAA 

28701 AGAATTCTTT AATTATGAAA TTTACTGTGA CTTTTCTGCT GATTATTTGC 
TCTTAAGAAA TTAATACTTT AAATGACACT GAAAAGACGA CTAATAAACG 

2B751 ACCCTATCTG CGTTTTGTTC CCCGACCTCC AAGCCTCAAA 6ACATATATC 
TGGGATAGAC GCAAAACAAG GGGCTGGAGG TTCGGAGTTT CTGTATATAG 

28801 ATGCAGATTC ACTCGTATAT GGAATATTCC AAGTTGCTAC AATGAAAAAA 
TACGTCTAAG TGAGCATATA CCTTATAAGG TTCAACGATG TTACTTTTTT 

28851 GCGATCTTTC CGAAGCCTGG TTATATGCAA TCATCTCTGT TATGGTGTTC 
CGCTAGAAAG GCTTCGGACC AATATACGTT AGTAGAGACA ATACCACAAG 

28901 TGCAGTACCA TCTTAGCCCT AGCTATATAT CCCTACCTTG ACATTGGCTG 
ACGTCATGGT AGAATCGGGA TCGATATATA GGGATGGAAC TGTAACCGAC 

28951 GAACGCAATA GATGCCATGA ACCACCCAAC TTTCCCCGCG CCCGCTATGC 
CTTGCGTTAT CTACGGTACT TGGTGGGTTG AAAGGGGCGC GGGCGATACG 

29001 TTCCACTGCA ACAAGTTGTT GCCGGCGGCT TTGTCCCAGC CAATCAGCCT 
AAGGTGACGT TGTTCAACAA CGGCCGCCGA AACAGGGTCG GTTAGTCGGA 

29051 CGCCCACCTT CTCCCACCCC CACTGAAATC AGCTACTTTA ATCTAACAGG 
GCGGGTGGAA GAGGGTGGGG CTCACTTTAG TCGATGAAAT TAGATTGTCC 

29101 AGGAGATGAC TGACACCCTA GATCTAGAAA TGGACGGAAT TATTACAGAG 
TCCTCTACTG ACTQTGGGAT CTAGATCTTT ACCTGCCTTA ATAATGTCTC 

29151 CAGCGCCTGC TAGAAAGACG CAGGGCAGC6 GCCGAGCAAC AGCGCATGAA 
GTC6CGGACG ATCTTTCTGC GTCCCGTCGC CG6CTC6TT6 TCGCGTACTT 

29201 TCAAGAGCTC CAAGACATGG TTAACTTGCA CCAGTGCAAA AGGGGTATCT 
AGTTCTCGAG GTTCTGTACC AATTGAACGT GGTCACGTTT TCCCCATAGA 

29251 TTTGTCTCGT AAAGCAGGCC AAAGTCACCT ACGACAGTAA TACCACCGGA 
AAACA6AGCA TTTCGTCCGG TTTCAGTG6A T6CTGTCATT ATGGTGGCCT 

29301 CACCGCCTTA GCTACAAGTT GCCAACCAAG CGTCAGAAAT T6GTGGTCAT 
GTGGCCGAAT C6ATGTTCAA CGGTTGGTTC GCAGTCTTTA ACCACCAGTA 
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29351 GGTGGGAGAA J^j^ptXATTA CCATAACTCA GCACTCGGTA GAAACCf^pb 
CCACCCTCTT TTCGGGTAAT GGTATTGAGT CGTGAGCCAT CTTTGGCTTC 

29401 GCTGCATTCA CTCACCTTGT CAA6GACCTG AGGATCTCTG CACCCTTATT 
CGACGTAAGT GAGTGGAACA GTTCCTGGAC TCCTAGAGAC GTGGGAATAA 

29451 AAGACCCTGT GCGGTCTCAA AGATCTTATT CCCTTTAACT AATAAAAAAA 
TTCTGGGACA CGCCAGAOTT TCTAGAATAA GG6AAATTGA TTATTTTTTT 

29501 AATAATAAAG CATCACTTAC TTAAAATCAG TTAGCAAATT TCTGTCCAGT 
TTATTATTTC GTAGTGAATG AATTTTAGTC AATCGTTTAA AGACAGGTCA 

29551 TTATTCAGCA GCACCTCCTT GCCCTCCTCC CAGCTCTGGT ATTGCAGCTT 
AATAAGTCGT CGTGGAGGAA CG6GAGGAGG GTCGAGACCA TAACGTCGAA 

29601 CCTCCTGGCT GCAAACTTTC TCCACAATCT AAATGGAATG TCAGTTTCCT 
GGAGGACCGA CGTTTGAAA6 AGGT(n*TAGA TTTACCTTAC AGTCAAAGGA 

29651 CCTGTTCCTG TCCATCCGCA CCCACTATCT TCATGTTGTT 6CA6ATGAAG 
GGACAAGGAC AGGTAG6CGT GGGTGATAGA AGTACAACAA CGTCTACTTC 

29701 CGC6CAAGAC CGTCTGAAGA TACCTTCAAC CCCGTGTATC CATATGACAC 
GCGCGTTCTG GCAGACTTCT ATGGAAGTTG CCGCACATAG GTATACTGTG 

29751 GGAAACCGGT CCTCCAACTG TGCCTTTTCT TACTCCTCCC TTTGTATCCC 
CCTTTGGCCA GGA6GTTGAC ACGGAAAAGA ATGAGGAGGG AAACATAGGG 

29B01 CCAATGGGTT TCAAGAGAGT CCCCCTGGGG TACTCTCTTT GCGCCTATCC 
6GTTACCCAA AGTTCTCTCA GGGGGACCCC ATGAGAGAAA CGCGGATAGG 

29851 GAACCTCTAG TTACCTCCAA TGGCATGCTT GCGCTCAAAA TGGGCAACGG 
CTTGGAGATC AATGGAGGTT ACCGTAC6AA CGCGAGTTTT ACCCGTTGCC 

29901 CCTCTCTCTG GACGAGGCCG 6CAACCTTAC CTCCCAAAAT GTAACCACTG 
GGAGAGAGAC CTGCTCCGGC CGTTGGAATG .GAGGGTTTTA CATTGGTGAC 

29951 TGAGCCCACC TCTCAAAAAA ACCAAGTCAA ACATAAACCT GGAAATATCT 
ACTCGGGTGG AGA6TTTTTT TGGTTCAGTT TGTATTTGGA CCTTTATAGA 

30001 GCACCCCTCA CAGTTACCTC AGAAGCCCTA ACTGTGGCTG CCGCCGCACC 
CGTGGGGAGT GTCAATGGAG TCTTCGGGAT TGACACCGAC GGCGGCGTGG 

30051 TCTAATGGTC GCGGGCAACA CACTCACCAT GCAATCACAG GCCCCGCTAA 
AGATTACCAG CGCCCGTTGT GTGAGTGGTA CGTTAGTGTC CGCGGCGATT 

30101 CCGTGCACGA CTCCAAACTT AGCATTGCCA CCCAAGGACC CCTCACAGTG 
GGCACGTGCT GAGGTTTGAA TC6TAACGGT GGGTTCCTG6 G6AGTGTCAC 

30151 TCAGAAGGAA AGCTAGCCCT GCAAACATCA GGCCCCCTCA CCACCACCGA 
AGTCTTCCTT TCGATCGGQA CGTTTGTAGT CCGGGGGAGT GGTGGTGGCT 

30201 TA6CAGTACC CTTACTATCA CTGCCTCACC CCCTCTAACT ACTGCCACTG 
ATCGTCATG6 GAATGATAG7 GACGGAGTGG GGGAGATT6A TCAC6GTGAC 

30251 GTAGCTTGGG CATTGACTTG AAAGAGCCCA TTTATACACA AAATGGAAAA 
CATCGAACCC GTAACTGAAC TTTCTCGGGT AAATATGTGT TTTACCTTTT 
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3 03 OX CTAGGACTAA Ai^CGGGGC TCCTTTGCAT GTAACAGAtC'^^CtTAiCMk: 
GATCCTGATT tcBgCCCCG AGGAAACGTA CATTGTCTGC TGGATT^ffc 

30351 TTTGACCGTA GCAACTGGTC CAGGTGTGAC TATTAATAAT ACTTCCTTGC 
AAACTGGCAT CGTTGACCAG GTCCACACTG ATAATTATTA TGAAGGAACG 

30401 AAACTAAAGT TACTG6AGCC TTGGGTTTTG ATTCACAA6G CAATATGCAA 
TTTGATTTCA ATGACCTCGG AACCCAAAAC TAAGTGTTCC GTTATACGTT 

30451 CTTAATGTAG CAGGAGGACT AAGGATTGAT TCTCAAAACA GACGCCTTAT 
GAATTACATC GTCCTCCTGA TTCCTAACTA AGAGTTTTGT CTGC6GAATA 

30501 ACTTGATGTT AGTTATCCGT TTGATQCTCA AAACCAACTA AATCTAAGAC 
• TGAACTACAA TC71ATAG6CA AACTACGA6T TTTGGTTGAT TTAGATTCTG 

30551 TAGGACAGGG CCCTCTTTTT ATAAACTCAG CCCACAACTT GGATATTAAC 
ATCCTGTCCC GGGAGAAAAA TATTTGAGTC GGGTGTTGAA CCTATAATTG 

30601 TACAACAAAG GCCTTTACTT 6TTTACAGCT TCAAACAATT CCAAAAAGCT 
ATGTTGTTTC CGGAAATGAA CAAATGTCGA A6TTTGTTAA 6GTTTTTCGA 

30651 TGA6GTTAAC CTAAGCACTC CCAAGGGGTT GATGTTTGAC GCTACAGCCA 
ACTCCAATTG GATTCGTGAC 6GTTCCCCAA CTACAAACTG CGATGTCGGT 

30701 TAGCCATTAA TGCAGGAGAT GGGCTTGAAT TTGGTTCACC TAATGCACCA 
ATCGGTAATT ACGTCCTCTA CCCGAACTTA AACCAAGTGG ATTACGTGGT 

30751 AACACAAATC CCCTCAAAAC AAAAATT6GC CATGGCCTAG AATTTGATTC 
TTGTGTTTAG GGGAGTTTT6 TTTTTAACCG GTACCGGATC TTAAACTAAG 

30801 AAACAAGGCT ATGGTTCCTA AACTAGGAAC TGGCCTTAGT TTTGACAGCA 
TTTGTTCCGA TACCAAGGAT TTGATCCTTG ACCGGAATCA AAACTGTCGT 

30851 CAGGTGCCAT TACAGTA6GA AACAAAAATA ATGATAAGCT AACTTTGTGG 
GTCCACGGTA ATGTCATCCT TTGTTTTTAT TACTATTCGA TTGAAACACC 

30901 ACCACACCAG CTCCATCTCC TAACTGTA6A CTAAATGCAG AGAAA6AT6C 
TGGTGTGGTC GAGGTAGAGG ATTGACATCT GATTTACGTC TCTTTCTACG 

30951 TAAACTCACT TTGGTCTTAA CAAAATGTGG CAGTCAAATA CTTGCTACAG 
ATTTGAGTGA AACCAGAATT CTTTTACACC GTCAGTTTAT GAACGATGTC 

31001 TTTCAGTTTT GGCTGTTAAA GGCAGTTTGG CTCCAATATC TGGAACAGTT 
AAAGTCAAAA CCGACAATTT CCGTCAAACC GAGGTTATAG ACCTTGTCAA 

31051 CAAAGTGCTC ATCTTATTAT AAGATTTGAC GAAAAT66AG TGCTACTAAA 
GTTTCACGAG TAGAATAATA TTCTAAACTG CTTTTACCTC ACGATGATTT 

31101 CAATTCCTTC CTG6ACCCAG AATATTCCAA CTTTAGAAAT GGAGATCTTA 
GTTAA6GAAG GACCTGG6TC TTATAACCTT GAAATCTTTA CCTCTAGAAT 

31151 CTGAA6GCAC AGCCTATACA AACGCTGTTG GATTTATGCC TAACCTATCA 
GACTTCCGTG TCGGATATGT TTGCGACAAC CTAAATACGG ATTGGATAGT 

31201 GCTTATCCAA* AATCTCACGG TAAAACTGCC AAAAGTAACA TTGTCAGTCA 
CGAATAGGTT TTA6AGTGCC ATTTTGACGG TTTTCATTGT AACAGTCAGT 
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31251 agtttactta ajucgagaca aaactaaacc tgtaacacta accatthpsc 
tcaaatgaat ttiscctctgt tttgarttgg acattgtgat t6otaatgtg 

31301 taaacggtac acaggaaaca ggagacacaa ctccaagtgc atactctatg 
atttgccatg tgtcctttgt cctctgtgtt gaggttcacg tatgagatac 

31351 tcattttcat gggactggtc tggccacaac tacattaatg aaatatttgc 
agtaaaagta ccctgaccag accggtgttg atgtaattac tttataaacg 

31401 cacatcctct tacacttttt catacattgc ccaagaataa agaatcgttt 
gtgtaggaga atgt6aaaaa gtatgtaacg ggttcttatt tctta6caaa 

31451 gtgttatgtt tcaacgtgtt tatttttcaa ttgcagaaaa tttcaagtca 
cacaatacaa agttgcacaa ataaaaagtt aacgtctttt aaagttcagt 

31501 tttttcattc agtagtatag ccccaccacc acatagctta tacagatcac 
aaaaagtaag tcatcataitc ggggtggtgg tgtatcgaat atgtctagtg 

31551 cgtaccttaa tcaaactcac agaaccctag tattcaacct gccacctccc 

GCATGGAATT AGTTTGAGTG TCTTGGGATC ATAAGTTGGA CGGTGGAGGG 

31601 TCCCAACACA CAGAGTACAC AGTCCTTTCT CCCCGGCTGG CCTTAAAAAG 
AGGGTTGTGT GTCTCATGTG TCAOGAAAGA GGGGCCGACC GGAATTTTTC 

31651 CATCATATCA TGGGTAACAG ACATATTCTT AGGTGTTATA TTCCACACGG 
GTAGTATAGT ACCCATTGTC T6TATAA6AA TCCACAATAT AAGGTGTGCC 

31701 TTTCCTGTCG AGCCAAACGC TCATCAGTGA TATTAATAAA CTCCCCGGGC 
AAAGGACAGC TCGGTTTGCG AGTAGTCACT ATAATTATTT GAGGGGCCCG 

31751 AGCTCACTTA AGTTCATGTC GCTGTCCAGC TGCTGAGCCA CAGGCTGCTG 
TCCAGTGAAT TCAAGTACAG CGACAGGTCG ACGACTCGGT GTCCGACGAC 

31801 TCCAACTTGC GGTTGCTTAA CGGGCGGCGA AGGAGAAGTC CACGCCTACA 
AGGTTGAACG CCAACGAATT GCCCGCCGCT TCCTCTTCAG GTGCGGATGT 

31851 TGGGGGTAGA GTCATAATCG TGCATCAGGA TAGGGCGGTG GTGCTGCAGC 
ACCCCCATCT CAGTATTAGC ACGTAGTCCT ATCCCGCCAC CACGACGTCG 

31901 AGCGCGCGAA TAAACTGCTG CCCCCGCCGC TCCGTCCTGC AGGAATACAA 
TCGCGCGCTT ATTTGACGAC GGCGGCGGCG AGGCAGGACG TCCTTATGTT 

31951 CATGGCAGTG GTCTCCTCAG CGATGATTCG CACCGCCCGC AGCATAAGGC 
GTACCGTCAC CAGAGGAGTC GCTACTAAGC GT6GCGGGCG TCGTATTCCG 

32001 GCCTTGTCCT CCGGGCACAG CAGCCCACCC TGATCTCACT TAAATCAGCA 
CGGAACAGGA GGCCCGTGTC GTCGCGTGGG ACTAGAGTGA ATTTAGTCGT 

32051 CAGTAACTGC AGCACAGCAC CACAATATTG TTCAAAATCC CACAGTGCAA 
GTCATTGACG TCGTGTCGTG GTGTTATAAC AAGTTTTAGG GTGTCACGTT 

32101 G6CGCTGTAT CCAAAGCTCA TGGCGGGGAC CACAGAACCC ACGTGGCCAT 
CCGCGACATA G6TTTCGAGT ACCGCCCCTG GTGTCTTGGG TGCACCGGTA 

32151 CATACCACAA GCGCAGGTAG ATTAAGTGGC 6ACCCCTCAT AAACACGCTG 
GTATGGTGTT CGCGTCCATC TAATTCACC6 CTGGGGAGTA TTTGTGCGAC 
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32201 GACATAAACA T^CTCTTT TGGCATGTTG TAATTCACCA CCTCCC^A 
CTGTATTTGT AATGGAGAAA ACCGTACAAC ATTAAGTGGT GGAGGGCCAT 

32251 CCATATAAAC CTCTGATTAA ACATGGCGCC ATCCACCACC ATCCTAAACC 
GGTATATTTG GAGACTAATT TGTACCGCGG TAGGTGGTGG TAGGATTTGG 

32301 AGCTGGCCAA AACCTGCCCG CCGGCTATAC ACTGCAGGGA ACCOGGACTG 
TCGACCGGTT TTGGACGGGC GGCCGATATG TGACGTCCCT TGGCCCTGAC 

32351 GAACAATGAC AGTGGAGAGC CCAGGACTCG TAACCATGGA TCATCAT6CT 
CTT6TTACTG TCACCTCTCG GGTCXTGAGC ATT6GTACCT AGTAGTACGA 

32401 CGTCATQATA TCAATGTT6Q CACAACACA6 6CACACGTGC ATACACTTCC 
GCAGTACTAT AGTTACAACC GTGTTGTGTC CGTGTGCACG TATGTGAAGG 

32451 TCAGGATTAC AAGCTCCTCC CGCGTTASAA CCATATCCCA GGGAACAACC 
AGTCCTAATG TTCGAGGAGG GCGCAATCTT . GGTATAGGGT CCCTTGTTGG 

32501 CATTCCTGAA TCAGCGTAAA TCCCACACTG CAGGGAAGAC CTCGCACGTA 
GTAAGGACTT AGTCGCATTT AGGGTGTGAC GTCCCTTCTG GAGCGTGCAT 

32551 ACTCACGTTG TGCATTGTCA AAGTGTTACA TTCGGGCAGC AGCGGATGAT 
TGAGTGCAAC ACGTAACAGT TTCACAATGT AAGCCC6TCG TCGCCTACTA 

32601 CCTCCAGTAT GGTAGCGCGG GTTTCTGTCT CAAAAG6AGG TAGACGATCC 
GGAGGTCATA CCATCGCGCC CAAAGACAGA GTTTTCCTCC ATCTGCTA6G 

32651 CTACTGTACG GAGTGCGCCG AGACAACCGA GATCGTGTTG GTCGTAGTGT 
GATGACATGC CTCACGCGGC TCTGTTGGCT CTAGCACAAC CAGCATCACA 

32701 CATGCCAAAT GGAACGCCGG ACGTAGTCAT ATTTCCTGAA GCAAAACCAG 
GTACGGTTTA CCTTGCGGCC TGCATCAGTA TAAAGGACTT CGTTTTGGTC 

32751 GTGCGGGCGT GACAAACAGA TCTGCGTCTC CGGTCTCGCC GCTTAGATCG 
CACGCCCGCA CTGTTTGTCT AGACGCAGAG GCCAGAGCGG CGAATCTAGC 

32BQ1 CTCTGTGTAG TAGTTGTAGT ATATCCACTC TCTCAAAGCA TCCAGGCGCC 
GAGACACATC ATCAACATCA TATAGGTGAG AGAGTTTCGT AGGTCCGCGG 

32B51 CCCTGGCTTC GGGrPCTATG TAAACTCCTT CATGCGCCGC TGCCCTGATA 
GGGACC6AAG CCCAAGATAC ATTTGAGGAA GTACGCGGCG AC6GGACTAT 

32901 ACATCCACCA CCGCAGAATA AGCCTVCACCC AGCCAACCTA CACATTCGTT 
TGTAGGTGGT GGCGTCTTAT TCGGTGTGGG TCGGTTGGAT GTGTAAGCAA 

32951 CTGC6AGTCA CACAC6GGAG GAGCGGGAAG AGCTGGAAGA ACCATGTTTT 
GACCCTCAGT GTGTGCCCTC CTCGCCCTTC TCGACCTTCT TGGTACAAAA 

33001 TTTTTTTATT CCAAAAGATT ATCCAAAACC TCAAAATGAA GATCTATTAA 
AAAAAAATAA GGTTTTCTAA TAGGTTTTGG AGTTTTACTT CTAGATAATT 

33051 GTGAACGCGC TCCCCTCCGG TGGCGTGGTC AAACTCTACA GCCAAAGAAC 
CACTTGCGCG AGGGGAGGCC ACCGCACCAG TTTGAGATGT CGGTTTCTTG 

33101 AGATAATGGC ATTTGTAAGA TGTTGCACAA TGGCTTCCAA AAGGCAAACG 
TCTATTACCG TAAACATTCT ACAACGTGTT ACCGAAGGTT TTCCGTTTGC 
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33151 GCCCTCACGTl^fi-j^'STGGAC GTAAAGGCTA AACCCTTC-SG •eGTOJMi^ 
CGG6A6TGCA ^McACCTG CATTTCC6AT TT6GGAA6TC CCACUJMiG 

33201 CTCTATAAAC ATTCCAGCAC CTTCAACCAT GCCCAAATAA TTCTCATCTC 
6AGATATTTG TAAGGTC6TG GAAGTTGGTA CGG6TTTATT AAGAGTAGAG 

33251 GCCACCTTCT CAATATATCT CTAAGCAAAT CCCGAATATT AAGTCCGGCC 
CGGTGGAAGA GTTATATAGA GATTCGTTTA GGGCTTATAA TTCAGGCCGG 

33301 ATTGTAAAAA TCTGCTCCAG AGCGCCCTCC ACCTTCAGCC TCAAGCAGCG 
TAACATTTTT AGAC6A6GTC TCGCGGGAGG TG6AAGTCGG AGTTCGTCGC 

33351 AATCATGATT GCAAAAATTC AGGTTCCTCA CAGACCTGTA TAAGATTCAA 
TTAGTACTAA CGTTTTOAAG TCCAAGGAGT GTCTGGACAT ATTCTAAGTT 

33401 AAGCGGAACA TTAACAAAAA TACCGCGATC CCGTAGGTCC CTTCGCAGGG 
TTC6CCTTGT AATTGTTTTT ATGGCGCTAG GGCATCCAGG GAAGCGTCCC 

33451 CCAGCTGAAC ATAATCGTGC AGGTCTGCAC GGACCAGCGC 6GCCACTTCC 
GGTCGACTTG TATTAGCACG TCCAGAC6TG CCTGGTCGCG CCGGTGAAGG 

33S01 CCGCCAGGAA CCATGACAAA AGAACCCACA CTGATTATGA CACGCATACT 
GGC6GTCCTT GGTACTGTTT TCTTGGGTGT GACTAATACT GTGCGTATGA 

33551 CGGAGCTATG CTAACCAGCG TAGCCCCGAT GTAAGCTTGT TGCATGGGCG 
GCCTCGATAC GATTGGTCGC ATCGGGGCTA CATTCGAACA ACGTACCCGC 

33601 GCGATATAAA ATGCAAGGTG CTGCTCAAAA AATCAGGCAA AGCCTCGCGC 
CGCTATATTT TACGTTCCAC GACGAGTTTT TTAGTCCGTT TCGGAGCGCG 

33651 AAAAAAGAAA GCACATCGTA GTCATGCTCA TGCAGATT^ GGCAGGTAAG 
TTTTTTCTTT CGTGTAGCAT CAGTACGAGT ACGTCTATTT CCGTCCATTC 

33701 CTCCGGAACC ACCACAGAAA AAGACACCAT TTTTCTCTCA AACATGTCTG 
GAGGCCTTGG TGGTGTCTTT TTCTGT6GTA AAAAGAGAGT TTGTACAGAC 

33751 CGGGTTTCTG CATAAACACA AAATAAAATA ACAAAAAAAC ATTTAAACAT 
GCCCAAAGAC GTATTTGTGT TTTATTTTAT TGTTTTTTTG TAAATTTGTA 

3380i TAGAAGCCTG TCTTACAACA GGAAAAACAA CCCTTATAAG CATAAGACGG 
ATCTTCGGAC AGAATGTTGT CCTTTTTGTT GGGAATATTC GTATTCTGCC 

33851 ACTACGGCCA TGCCGGCGTG ACCGTAAAAA AACTGGTCAC CGTGATTAAA 
TGATGCCGGT ACGGCCGCAC TGGCATTTTT TTGACCAGTG GCACTAATTT 

33901 AAGCACCACC GACAGCTCCT CGGTCATGTC CGGAGTCATA ATGTAAGACT 
TTCGTGGTGG CTGTCGAGGA GCCAGTACAG GCCTCA6TAT TACATTCTGA 

33951 CGGTAAACAC ATCAGGTTGA TTCACATCGG TCAGTGCTAA AAAGCGACCG 
GCCATTTGTG TAGTCCAACT AAGTGTAGCC AGTCACGATT TTTCGCTGGC 

34001 AAATAGCCCG GGGGAATACA TACCCGCAGG CGTAGAGACA ACATTACAGC 
TTTATCGGGC CCCCTTATGT ATGGGCGTCC GCATCTCTGT TGTAATGTCG 

34051 CCCCATAGGA GGTATAACAA AATTAATAGG AGAGAAAAAC ACATAAACAC 
GGGGTATCCT CCATATTGTT TTAATTATCC TCTCTTTTTG TGTATTT6TG 
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34X01 CTGAAAAACCiCSfiSTGCCTA GGCAAAATAG CACCCTCC^'^TQGAtiSdUS^ 
GACTTTTTGG GJ^BACGGAT CCGTTTTATC GT6GGAGGGC GAGGTC^T 

34151 ACATACAGCG CTTCCACAQC GGCAGCCATA ACAGTCAGCC TTACCAGTAA 
TGTATGTCGC GAAGGTGTCG CCGTCGGTAT TGTCAGTCGG AATGGTCATT 

34201 AAAAGAAAAC CTATTAAAAA AACACCACTC GACACGGCAC CAGCTCAATC 
TTTTCTTTTG GATAATTTTT TTGTGGTGAG CTGTGCCGTG GTCGAGTTAG 

34251 AGTCACAGTG TAAAAAAGGG CCAAGTGCAG AGCGAGTATA TATAGGACTA 
TCAGTGTCAC ATTTTTTCCC GGTTCACGTC TCGCTCATAT ATATCCTGAT 

34301 AAAAAT6ACG TAACGGTTAA AGTCCACAAA AAACACCCAG AAAACC6CAC 
TTTTTACTGC ATT6CCAATT TCAG6TGTTT TTTGTGGGTC TTTTGGCGTG 

34351 GCGAACCTAC GCCCAGAAAC 6AAAGCCAAA AAACCCACAA CTTCCTCAAA 
CGCTTGGATG CGGGTCTTTG CTTTCGGTTT TTTGGGTGTT GAAGGAGTTT 

34401 TCGTCACTTC CGTTTTCCCA CGTTACGTCA CTTCCCATTT TAAGAAAACT 
AGCAGTGAAG GCAAAAGGGT GCAATGCA6T GAAGGGTAAA ATTCTTTTGA 

34451 ACAATTCCCA ACACATACAA GTTACTCCGC CCTAAAACCT ACGTCACCCG 
TGTTAAGGGT TGTGTATGTT CAATGAGGCG GGATTTTGGA TGCAGTGGGC 

34501 CCCCGTTCCC ACGCCCCGCG CCACGTCACA AACTCCACCC CCTCATTATC 
GGGGCAAGGG TGCGGGGCGC GGTGCAGTGT TTGAGGTGGG GGAGTAATAG 

Pad 



3455a ATATTGGCTT CAATCCAAAA TAAGGTATAT TATTGATGAT GTTAATTAAG 
TATAACCGAA GTTAGGTTTT ATTCCATATA ATAACTAGTA CAATTAATTC 

34601 AATTCGGATC TGCGACGCGA GGCTGGATGG CCTTCCCCAT TATGATTCTT 
TTAAGCCTAG ACGCTGCGCT CCGACCTACC GGAAGGGGTA ATACTAAGAA 

34651 CTCGCTTCCG GCGGCATCGG GATGCCCGCG TTGCAGGCCA TGCTGTCCAG 
GAGCGAAGGC CGCCGTAGCC CTACGGGCGC AACGTCCGGT ACGACAGGTC 

34701 GCAGGTAGAT GACGACCATC AGGGACAGCT TCAA6GCCAG CAAAAGGCCA 
CGTCCATCTA CTGCTGGTAG TCCCTGTCGA AGTTCCGGTC GTTTTCCGGT 

34751 GGAACCGTAA AAAGGCCGCG TT6CTGGC6T TTTTCCATAG GCTCCGCCCC 
CCTTGGCATT TTTCCGGCGC AACGACCGCA AAAAGGTATC CGAGGCGGGG 

34801 CCTGACGAGC ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC 
GGACTGCTCG TAGTGTTTTT AGCTGCGAGT TCAGTCTCCA CCGCTTTOGG 

34B51 GACAGGACTA TAAAGATACC AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC 
CTGTCCTGAT ATTTCTATGG TCCGCAAAGG GGGACCTTCG AGGGAGCACG 

34901 GCTCTCCTGT TCCGACCCTG CCGCTTACCG GATACCTGTC CGCCTTTCTC 
CGAGAGGACA AGGCTGGGAC GGCGAATGGC CTATGGACAG GCGGAAAGAG 

34951 CCTTCGGGAA GCGTGGCGCT TTCTCATAGC TCAC6CTGTA GGTATCTCAG 
GGAAGCCCTT CGCACCGCGA AAGAGTATCG AGTGCGACAT CCATAGAGTC 

35001 TTCGGTGTAG 6TCGTTC6CT CCAAGCTGGG CTGTGTGCAC GAACCCCCCG 
AAGCCACATC CAGCAAGCGA GGTTCGACCC GACACACGTG CTTGGGGGGC 
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35051 TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT TGAGTCCAAC 
AAGTCGGGCT GGCGACGCGG AATAGGCCAT TGATAGCAGA* ACTCAGGTTG 

35101 CCGGTAAGAC ACGACTTATC GCCACTGGCA GCAGCCACTG GTAACAGGAT 
GGCCATTCTG TGCTGAATAG CGGTGACCGT CGTC6GTGAC CATTGTCCTA 

35151 TAGCAGA6CG AGGTATGTAG GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC 
ATCGTCTCGC TCCATACATC CGCCACGATG TCTCAAGAAC TTCACCACCG 

35201 CTAACTACGG CTACACfAGA AGGACAGTAT TTGGTATCTG CGCTCTGCTG 
GATTGATGCC GATGTGATCT TCCTGTCATA AACCATAGAC GCGAGACGAC 

35251 AAGCCAGTTA CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT CCGGCAAACA 
TTCGGTCAAT GGAAGCCTTT TTCTCAACCA TCGAGAACTA 6GCCGTTTGT 

35301 AACCACCGCT GGTAGCGGTG GTTTTTTTGT TTGCAAGCAG CAGATTACGC 
TTGGTGGCGA CCATCGCCAC CAAAAAAACA AACGTTCGTC GTCTAATGCG 

35351 GCAGAAAAAA AGGATCTCAA GAAGATCCTT TGATCTTTTC TACGGGGTCT 
CGTCTTTTTT TCCTAGAGTT CTTCTAGGAA ACTAGAAAAG ATGCCCCAGA 

35401 GACGCTCAGT GGAACGAAAA CTCACGTTAA GGGATTTTGG TCATGAGATT 
CTGCGAGTCA CCTT6CTTTT GAGTGCAATT CCCTAAAACC AGTACTCTAA 

35451 ATCAAAAAGG ATCTTCACCT AGATCCTTTT AAATCAATCT AAAGTATATA 
TAGTTTTTCC TAGAAGT6GA TCTAGGAAAA TTTAGTTAGA TTTCATATAT 

35501 TGAGTAAACT TGGTCTGACA GTTACCAATG CTTAATCAGT GAGGCACCTA 
ACTCATTTGA ACCAGACTGT CAATGGTTAC GAATTAGTCA CTCCGTGGAT 

■^5551 TCTCAGCGAT CTGTCTATTT CGTTCATCCA TAGTTGCCTG ACTCCCCGTC 
AGAGTCGCTA GACAGATAAA GCAAGTAGGT ATCAACGGAC TGAGGGGCAG 

35601 GTGTAGATAA CTACGATACG GGA6GGCTTA CCATCTGGCC CCAGTGCTGC 
CACATCTATT GATGCTATGC CCTCCCGAAT GGTAGACCGG GGTCACGACG 

35651 AATGATACCG CGAGACCCAC GCTCACCGGC TCCAGATTTA TCAGCAATAA 
TTACTATGGC GCTCTGGGTG CGA6TGGCCG AGGTCTAAAT AGTCGTTATT 

35701 ACCAGCCAGC CGGAAGG6CC GAGCGCA6AA GTGGTCCTGC AACTTTATCC 
TGGTCGGTCG GCCTTCCCGG CTCGCGTCTT CACCAGGACG TTGAAATAGG 

35751 GCCTCCATCC AGTCTATTAA TTGTTGCC6G GAAGCTAGAG TAAGTAGTTC 
CGGAGGTAGG TCAGATAATT AACAACGGCC CTTCGATCTC ATTCATCAAG 

35801 GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC CATTGCTACA QGCATCGTGG 
CG6TCAATTA TCAAACGCGT TGCAACAACG GTAACGATGT CCGTA6CACC 

35851 TGTCACGCTC GTCGTTTGGT ATGGCTTCAT TCAGCTCCGG TTCCCAACGA 
ACAGTGCGAG CAGCAAACCA TACCGAAGTA AGTCGAGGCC AAGGGTTGCT 

35901 TCAAGGCGAG TTACATGATC CCCCATGTTG TGCAAAAAAG CGGTTAGCTC 
AGTTCCGCTC AATGTACTAG GGGGTACAAC ACGTTTTTTC GCCAATCGAG 

35951 CTTCGGTCCT CCGATCGTTG TCAGAAGTAA GTTGGCCGCA GTGTTATCAC 
GAAGCCAGGA GGCTAGCAAC AGTCTTCATT CAACCGGCGT CACAATAGTG 
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36001 TCATGGTTAT^WP^GCACTG CATAATTCTC TTACTGTCAT GCCAT^PtA 
AGTACCAATA CCGTCGTGAC GTATTAA6AG AATGACAGTA CGGTAGGCAT 

36051 AGATGCTTTT CTGTGACTGG TGAGTACTCA ACCAAGTCAT. TCTGAGAATA 
TCTACGAAAA GACACTGACC ACTCATGAGT TGGTTCAGTA AGACTCTTAT 

36101 GTGTATGCGG CGACCGAGTT GCTCTTGCCC GGCGTCAACA CGGGATAATA 
CACATACGCC GCTOGCTCAA CGAGAACGGG CCGCAGTTGT GCCCTATTAT 

36151 CCGCGCCACA TAGCA6AACT TTAAAAGTGC TCATCATTGG AAAACGTTCT 
GGCGCGGTGT ATCGTCTTGA AATTTTCACG AGTAGTAACC TTTTGCAAGA 

36201 TCGGGGCGAA AACTCTCAAG GATCTTACCG CTGTTGAGAT CCAGTTCGAT 
AGCCCCGCTT TT6AGAGTTC CTAGAATGGC GACAACTCTA G6TCAAGCTA 

36251 GTAACCCACT CGTGCACCCA ACTGATCTTC AGCATCTTTT ACTTTCACCA 
CATTGGGTGA 6CACGTGGGT TGACTAGAAG TCGTAGAAAA TGAAAGTGGT 

36301 • GCGTTTCTGG GTGAGCAAAA ACAGGAAGGC AAAATGCCGC AAAAAAGGGA 
CGCAAAGACC CACTCGTTTT TGTCCTTCCG TTTTACGGCG TTTTTTCCCT 

36351 ATAAG6GCGA CACGGAAATG TTGAATACTC ATACTCTTCC TTTTTCAATA 
TATTCCCGCT GTGCCTTTAC AACTTATGAG TATGAGAAGG AAAAAGTTAT 

36401 TTATTGAAGC ATTTATCAGG GTTATTGTCT CATGAGCGGA TACATATTTG 
AATAACTTCG TAAATAGTCC CAATAACAGA GTACTCGCCT ATGTATAAAC 

36451 AATGTATTTA GAAAAATAAA CAAATAGGGG TTCCGCGCAC ATTTCCCCGA 
TTACATAAAT CTTTTTATTT GTTTATCCCC AAGGCGCGTG TAAAGGGGCT 

36501 AAAGTGCCAC CTGACGTCTA AGAAACCATT ATTATCATGA CATTAACCTA 
TTTCACGGTG GACTGCAGAT TCTTTGGTAA TAATAGTACT GTAATTG(3AT 

36551 TAAAAATAGG CGTATCACGA GGCCCTTTCG TCTTCAAGAA TTGGATCCGA 
ATTTTTATCC GCATAGTGCT CCGGGAAA6C AGAAGTTCTT AACCTAGGCT 



Pad 



36601 



ATTCTTAATT 
TAAGAATTAA 



TCTTAATTAA 
AGAATTAATT 



(SEQ ZD NO:34) 
(SEQ ID NO: 35) 
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AEX-HPLC Cell-Associated Viral Particle Concentration 
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atg gat gca atg aag aga ggg etc tgc tgt gtg ctg ctg ctg tgt gga 48 
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 

gca gtc ttc gtt teg ccc age gag ate tee att gtg tgg gee tec agg 96 
Ala Val Phe Val Ser Pro Ser Glu lie Ser He Val Trp Ala Sex Arg 
20 25 30 

gag ctg gag agg ttt get gtg aac cct ggc ctg ctg gag ace tct gag 144 
Glu Leu Glu Arg Phe Ala Val Asn Pro Gly Leu Leu Glu Thr Ser Glu 
35 40 45 

ggg tgc agg cag ate ctg ggc cag etc cag ccc tec ctg caa aca ggc 192 
Gly Cys Arg Gin He Leu Gly Gin Leu Gin Pro Ser Leu Gin Thr Gly 
50 55 60 

tct gag gag ctg agg tec ctg tac aac aca gtg get ace ctg tac tgt 240 
Ser Glu Glu Leu Arg Ser Leu Tyr Asn Thr Val Ala Thr Leu Tyr Cys 
65 70 75 80 

gtg eac cag aag att gat gtg aag gac acc aag gag gee ctg gag aag 288 
val His Gin Lys He Asp Val Lys Asp Thr Lys Glu Ala Leu Glu Lys 
85 90 95 

att gag gag gag cag aac aag tee aag aag aag gee cag cag get get 336 
He Glu Glu Glu Gin Asn Lys Ser Lys Lys Lys Ala Gin Gin Ala Ala 
100 105 110 

get ggc aca ggc aac tec age cag gtg tec cag aac tac ccc att gtg 384 
Ala Gly Thr Gly Asn Ser Ser Gin Val Ser Gin Asn Tyr Pro He Val 
115 120 125 

cag aac etc cag ggc cag atg gtg eac cag gcc ate tec ccc egg acc 432 
Gin Asn Leu Gin Gly Gin Met Val His Gin Ala He Ser Pro Arg Thr 
130 135 140 

ctg aat gcc tgg gtg aag gtg gtg gag gag aag gcc ttc tec cct gag 480 
Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro Glu 
145 150 155 160 

gtg ate ccc atg ttc tct gcc ctg tct gag ggt gcc acc ccc cag gac 528 
Val He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin Asp 
165 170 175 

ctg aac acc atg ctg aac aca gtg ggg ggc cat cag get gcc atg cag 576 
Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala Ala Met Gin 
180 185 190 



atg ctg aag gag acc ate aat gag gag get get gag tgg gac agg ctg 
Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu Trp Asp Arg Leu 
195 200 205 



agg ggc tct gac att get ggc acc acc tec acc etc cag gag cag att 
Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin He 
225 230 235 240 



624 



cat cct gtg cac get ggc ccc att gcc ccc ggc cag atg agg gag ccc 672 
His Pro Val His Ala Gly Pro He Ala Pro Gly Gin Met Arg Glu Pro 
210 215 220 



720 



ggc tgg atg acc aac aac ccc ccc ate cct gtg ggg gaa ate tac aag 768 
Gly Trp Met Thr Asn Asn Pro Pro He Pro Val Gly Glu He Tyr Lys 
245 250 255 
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agg tgg ate ate ctg ggc ctg aac aag att gtg agg atg tac tec ccc 616 
Arg Trp lie lie Leu Gly Leu Asn Lys lie Val Arg Met Tyr Ser Pro 
260 265 270 

ace tec ate ctg gac ate agg cag ggc ccc aag gag ccc ttc agg gac 864 
Thr Ser lie Leu Asp lie Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp 
275 280 285 

tat gtg gac agg ttc tac aag acc ctg agg get gag cag gee tec cag 912 
Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Ser Gin 
290 295 300 

gag gtg aag aac tgg atg aca gag acc ctg ctg gtg cag aat gee aac 960 
Glu Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin Asn Ala Asn 
305 310 315 320 

cct gac tgc aag acc ate ctg aag gee ctg ggc cct get gcc acc ctg 1008 
Pro Asp Cys Lys Thr lie Leu Lys Ala Leu Gly Pro Ala Ala Thr Leu 
325 330 335 

gag gag atg atg aca gcc tgc cag ggg gtg ggg ggc cct ggt cac aag 1056 
Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His Lys 
340 345 350 

gcc agg gtg ctg get gag gcc atg tec cag gtg acc aac tec gcc acc 1104 
Ala Arg Val Leu Ala Glu Ala Met Ser Gin Val Thr Asn Ser Ala Thr 
355 360 365 

ate atg atg cag agg ggc aac ttc agg aac cag agg aag aca gtg aag 1152 
lie Met Met Gin Arg Gly Asn Phe Arg Asn Gin Arg Lys Thr Val Lys 
370 375 380 

tgc ttc aac tgt ggc aag gtg ggc cac att gcc aag aac tgt agg gee 1200 
Cys Phe Asn Cys Gly Lys Val Gly His lie Ala Lys Asn Cys Arg Ala 
'oe • 395 400 

ccc agg aag aag ggc tgc tgg aag tgt ggc aag gag ggc cac cag atg 1248 
Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly His Gin Met 
405 410 415 

aag gac tgc aat gag agg cag gcc aac ttc ctg ggc aaa ate tgg ccc 1296 
Lys Asp Cys Asn Glu Arg Gin Ala Asn Phe Leu Gly Lys He Trp Pro 
420 425 430 

tec cac aag ggc agg cct ggc aac ttc etc cag tec agg cct gag ccc 1344 
Ser His Lys Gly Arg Pro Gly Asn Phe Leu Gin Ser Arg Pro Glu Pro 
435 440 445 

aca gcc cct ccc gag gag tec ttc agg ttt ggg gag gag aag acc ace 1392 
Wir Ala Pro Pro Glu Glu Ser Phe Arg Phe Gly Glu Glu Lys Thr Thr 
450 455 460 

ccc age cag aag cag gag ccc att gac aag gag ctg tac ccc ctg gcc 1440 
Pro Ser Gin Lys Gin Glu Pro lie Asp Lys Glu Leu Tyr Pro Leu Ala 
465 470 475 480 . 

tec ctg agg tec ctg ttt ggc aac gac ccc tec tec cag taa (SID NO: 36) 1482 
Ser Leu Arg Ser Leu Phe Gly Asn Asp Pro Ser Ser Gin * (SID NO: 37) 
485 490 
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FIGURE 33A 

ATGGGTGCTA GGGCTTCTGT GCTGTCTGGT GGTGAGCTGG ACAAGTGGGA GAAGATCAGG 
CTGAGGCCTG GTGGCAAGAA GAAGTACAAG CTAAAGCACA TTGTGTGGGC CTCCAGGGAG 
CTGGAGAGGT TTGCTGTGAA CCCTG6CCTG CTGGAGACCT CTGAGGGGTG CAGGCAGATC 
CTGGGCCAGC TCCAGCCCTC CCTGCAAACA GGCTCTGAGG AGCTGAGGTC CCTGTACAAC 
ACAGTGGCTA CCCTGTACTG TGTGCACCAG AAGATTGATG TGAAGGACAC CAAGGAGGCC 
CTG6AGAAGA TTGAGGAGGA GCAGAACAAG TCCAA6AAGA AGGCCCAGCA GGCTGCTGCT 
GGCACAGGCA ACTCCAGCCA GGTGTCCCAG AACTACCCCA TTGTGCAGAA CCTCCAGGGC 
CAGATGGTGC ACCAGGCCAT CTCCCCCCGG ACCCTGAATG CCTGGGTGAA GGTGGTGGAG 
GAGAAGGCCT TCTCCCCTGA GGTGATCCCC ATGTTCTCTG CCCTGTCTGA GGGTGCCACC 
CCCCAGGACC TGAACACCAT GCTGAACACA GTGGGGGGCC ATCAGGCTGC CATGCAGATG 
CTGAAGGAGA CCATCAATGA GGAGGCTGCT GAGTGGGACA GGCTGCATCC TGTGCACGCT 
GGCCCCATTG CCCCCGGCCA GATGAGGGAG CCCAGGGGCT CTGACATTGC TGGCACCACC 
TCCACCCTCC AGGAGCAGAT TGGCTGGATG ACCAACAACC CCCCCATCCC TGTGGGGGAA 
ATCTACAAGA GGTGGATCAT CCTGGGCCTG AACAAGATTG TGAGGATGTA CTCCCCCACC 
TCCATCCTGG ACATCAGGCA GGGCCCCAAG GAGCCCTTCA GGGACTATGT GGACAGGTTC 
TACAAGACCC TGAGGGCTGA GCAGGCCTCC CAGGAGGTGA AGAACTGGAT GACAGAGACC 
CTGCTGGTGC AGAATGCCAA CCCTGACTGC AAGACCATCC TGAAGGCCCT GGGCCCTGCT 
GCCACCCTGG AGGAGATGAT GACAGCCTGC CAGGGGGTGG GG6GCCCTGG TCACAAGGCC 
AGGGTGCTGG CTGAGGCCAT GTCCCAGGTG ACCAACTCCG CCACCATCAT GATGCAGAGG 
GGCAACTTCA GGAACCAGAG GAAGACAGTG AAGTGCTTCA ACTGTGGCT^ GGTGGGCCAC 
ATTGCCAAGA ACTGTAGGGC CCCCAGGAAG ■ AAGGGCTGCT GGAAGTGTGG CAAGGAGGGC 
CACCAGATGA AGGACTGCAA TGAGAGGCAG GCCAACTTCC TGGGCAAAAT CTGGCCCTCC 
CACAAGGGCA GGCCTGGCAA CTTCCTCCAG TCCAGGCCTG AGCCCACAGC CCCTCCCGAG 
GAGTCCTTCA GGTTTGGGGA GGAGAAGACC ACCCCCAGCC AGAAGCAGGA GCCCATTGAC 
AAGGAGCTGT ACCCCCTGGC CTCCCT6AGG TCCCTGTTTG GCAACGACCC CTCCTCCCAG 
ATGGCTCCCA TCTCCCCCAT TGAGACTGTG CCTGTGAAGC TGAAGCCTGG CATGGATGGC 
CCCAAGGTGA AGCAGTGGCC CCTGACTGAG GAGAAGATCA AGGCCCTGGT GGAAATCTGC 
ACTGAGATGG AGAAGGAGGG CAAAATCTCC AAGATTGGCC CCGAGAACCC CTACAACACC 
CCTGTGTTTG CCATCAAGAA GAAGGACTCC ACCAAGT6GA GGAAGCTGGT GGACTTCAGG 
GAGCTGAACA AGAGGACCCA GGACTTCTGG GAGGTGCAGC TGGGCATCCC CCACCCCGCT 
GGCCTGAAGA AGAAGAAGTC TGTGACTGTG CTGGCTGTGG GGGATGCCTA CTTCTCTGTG 
CCCCTGGATG AG6ACTTCAG GAAGTACACT GCCTTCACCA TCCCCTCCAT CAACAATGAG 
ACCCCTGGCA TCAGGTACCA GTACAATGTG CTGCCCCAG6 GCTGGAAGGG CTCCCCTGCC 
ATCTTCCA6T CCTCCATGAC CAAGATCCTG GAGCCCTTCA GGAAGCAGAA CCCTGACATT 
GTGATCTACC AGTACATGGC TGCCCTGTAT GTGGGCTCTG ACCTGGAGAT TGGGCAGCAC 
AGGACCAAGA TTGAGGAGCT GAGGCAGCAC CTGCTGAGGT GGGGCCTGAC CACCCCTGAC 
AA6AAGCACC AGAAGGAGCC CCCCTTCCTG TGGATG6GCT ATGAGCTGCA CCCCGACAAG 
TGGACTGTGC A6CCCATTGT GCTGCCTGAG AAGGACTCCT GGACTGTGAA TGACATCCAG 
AAGCTGGTGG GCAAGCTGAA CTGGGCCTCC CAAATCTACC CTGGCATCAA GGTGAGGCAG 
CTGTGCAAGC TGCTGAGGGG CACCAAGGCC CTGACTGAGG TGATCCCCCT GACTGAGGAG 
GCTGAGCTGG AGCTGGCTGA GAACAGGGAG ATCCTGAAGG AGCCTGTGCA TGGGGTGTAC 
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TATGACCCCT CCAAGGACCT 
TACCAAATCT ACCAGGAGCC 
GGGGCCCACA CCAATGATGT 
TCCATTGTGA TCTGGGGCAA 
GAGACCTGGT GGACTGAGTA 
ACCCCCCCCC TGGTGAAGCT 
ACCTTCTATG TGGCTGGGGC 
ACCAACAGGG GCAGGCAGAA 
CTCCAGGCCA TCTACCTGGC 
TCCCAGTATG CCCTGGGCAT 
AACCAGATCA TTGAGCAGCT 
CACAAGGGCA TTGGGGGCAA 
GTGCTGTTCC TGGATGGCAT 
TGGAGGGCTA TGGCCTCTGA 
TCCTGTGACA AGTGCCAGCT 
GGCATCTGGC AGCTGGCCTG 
GTGGCCTCCG GCTACATTGA 
TACTTCCTGC TGAAGCTGGC 
TCCT^CTTCA CTGGG6CCAC 
TT" ^r\TCC CCTACAACCC 
AAGAAGATCA TTGGGCAGGT 
GCTGTGTTCA TCCACAACTT 
AGGATTGTGG ACATCATTGC 
AAGATCCAGA ACTTCAGGGT 
GCCAAGCTGC TGTGGAAGGG 
GTGGTGCCCA GGAGGAAGGC 
GACTGTGTGG CCTCCAGGCA 
S£Q ID NO: 38 



FIGURE 33B 

GATTGCTGAG ATCCAGAAGC 
CTTCAAGAAC CTGAAGACTG 
GAAGCAGCTG ACTGAGGCTG 
GACCCCCAAG TTCAAGCTGC 
CTGGCAGGCC ACCTGGATCC 
GTGGTACCAG. CTGGAGAAGG 
TGCCAACAGG GAGACCAAGC 
6GTGGTGACC CTGACTGACA 
CCTCCAGGAC TCTGGCCTGG 
CATCCAGGCC CAGCCTGATC 
GATCAAGAAG 6AGAAGGTGT 
TGAGCAGGTG GACAAGCTGG 
TGACAAGGCC CAGGATGAGC 
CTTCAACCTG CCCCCTGTGG 
GAAGGGGGAG GCCATGCATG 
CACCCACCTG GAGGGCAAGG 
GGCTGAGGTG ATCCCTGCTG 
TGGCAGGTGG CCTGTGAAGA 
AGTGAGGGCT GCCTGCTGGT 
CCAGTCCCAG GGGGTGGTGG 
GAGGGACCAG GCTGAGCACC 
CAAGAGGAAG GGGGGCATCG 
CACAGACATC CAGACCAAGG 
GTACTACAGG GACTCCAGGA 
GGAGGGGGCT GTGGTGATCC 
CAAGATCATC AGGGACTATG 
6GATGA6GAC TAA 



AGGGCCAGGG CCAGTGGACC 
GCAAGTATGC CAGGATGAGG 
TGCAGAAGAT CACCACTGAG 
CCATCCAGAA GGAGACCTGG 
CTGAGTGGGA GTTTGTGAAC 
AGCCCATTGT GGGGGCTGAG 
•TGGGCAAGGC TGGCTATGTG 
CCACCAACCA GAAGACT6CC 
AGGTGAACAT TGTGACTGCC 
AGTCTGAGTC TGAGCTGGTG 
AGCTGGCCTG GGTGCCTGCC 
TGTCTGCTGG CATCAGGAAG 
ATGAGAAGTA CCACTCCAAC 
TGGCTAAGGA GATTGTGGCC 
GGCAGGTGGA CT6CTCCCCT 
TGATCCTGGT 6GCTGTGCAT 
AGACAGGCCA GGAGACTGCC 
CCATCCACAC TGCCAATGGC 
GGGCTGGCAT CAAGCA6GAG 
CCTCCATGAA CAAGGAGCTG 
TGAAGACAGC TGTGCAGATG 
GGGGCTACTC CGCTGGGGAG 
AGCTCCAGAA GCAGATCACC 
ACCCCCTGTG GAAGGGCCCT 
AGGACAACTC TGACATCAAG 
GCAAGCAGAT G6CTGGG6AT 
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FIGURE 34A 

Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Lys Trp Glu Lys 
lie Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys His lie Val Trp 
Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro Gly Leu Leu Glu Thr Ser 
Glu Gly Cys Arg Gin He Leu Gly Gin Leu Gin Pro Ser Leu Gin Thr Gly Ser 
Glu Glu Leu Arg Ser Leu Tyr Asn Thr Val Ala Thr Leu Tyr Cys Val His Gin 
Lys He Asp Val Lys Asp Thr Lys Glu Ala Leu Glu Lys He Glu Glu Glu Gin 
Asn Lys Ser Lys Lys Lys Ala Gin Gin Ala Ala Ala Gly Thr Gly Asn Ser Ser 
Gin Val Ser Glh Asn Tyr Pro He Val Gin Asn Leu Gin Gly Gin Met Val His 
Gin Ala He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys 
Ala Phe Ser Pro Glu Val He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala Thr 
Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala Ala Met 
Gin Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu Trp Asp Arg Leu His 
Pro Val His Ala Gly Pro He Ala Pro Gly Gin Met Arg Glu Pro Arg Gly Ser 
Asp He Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin He Gly Trp Met Thr Asn 
Asn Pro Pro He Pro Val Gly Glu He Tyr Lys Arg Trp He He Leu Gly Leu 
Asn Lys He Val Arg Met Oyr Ser Pro Thr Ser He Leu Asp He Arg Gin Gly 
Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala 
Glu Gin Ala Ser Gin Glu Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin 
Asn Ala Asn Pro Asp Cys Lys Thr He Leu Lys Ala Leu Gly Pro Ala Ala Thr 
L? Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His Lys Ala 
Arg Val Leu Ala Glu Ala Met Ser Gin Val Thr Asn Ser Ala Thr He Met Met 
Gin Arg Gly Asn Phe Arg Asn Gin Arg Lys Thr Val Lys Cys Phe Asn Cys Gly 
Lys Val Gly His He Ala Lys Asn Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp 
Lys Cys Gly Lys Glu Gly His Gin Met Lys Asp Cys Asn Glu Arg Gin Ala Asn 
Phe Leu Gly Lys He Trp Pro Ser His Lys Gly Arg Pro Gly Asn Phe Leu Gin 
Ser Arg Pro Glu Pro Thr Ala Pro Pro Glu Glu Ser Phe Arg Phe Gly Glu Glu 
Lys Thr Thr Pro Ser Gin Lys Gin Glu Pro He Asp Lys Glu Leu Tyr Pro Leu 
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Phe Trp Glu Val Gin Leu 


Gly 


He 


Pro His 


Pro 


Ala Gly 


Leu 


Lys 


Lys 


Lys 


Lys 


Ser Val Thr Val Leu Ala 


Val 


Gly Asp Ala 


Tyr 


Phe Ser 


Val 


Pro 


Leu 


Asp 


Glu 


Asp Phe Arg Lys Tyr Thr 


Ala 


Phe 


Thr He 


Pro 


Ser He 


Asn 


Asn 


Glu 


Thr 


Pro 


Gly He Arg Tyr Gin Tyr 


Asn 


Val 


Leu Pro 


Gin 


Gly Trp 


Lys 


Gly 


Ser 


Pro 


Ala 


He Phe Gin Ser Ser Met 


Thr Lys 


He Leu 


Glu 


Pro Phe 


Arg 


Lys 


Gin 


Asn 


Pro 


Asp He Val He Tyr Gin 


Tyr 


Met 


Ala Ala 


Leu 


Tyr Val 


Gly 


Ser 


Asp 


Leu 


Glu 


He Gly Gin His Arg Thr 


Lys 


He 


Glu Glu 


Leu 


Arg Gin 


His 


Leu 


Leu 


Arg 


Trp 


Gly Leu Thr Thr Pro Asp 


Lys 


Lys His Gin Lys 


Glu Pro 


Pro 


Phe 


Leu Trp 


Met Gly Tyr Glu Leu His Pro 
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Asp Lys 


Trp 


Thr 


Val 


Gin 


Asn Asp 


He 


Gin 


Lys 


Leu 


Gly 


He 


Lys 


Val 


Arg 


Gin 


Glu 


Val 


He 


Pro 


Leu 


Thr 


He 


Leu 


Lys 


Glu 


Pro Val 


Ala 


Glu 


He 


Gin 


Lys 


Gin 


Pro 


Phe 


Lys 


Asn 


Leu 


Lys 


Asn 


Asp 


Val 


Lys 


Gin 


Leu 


Val 


He 


Trp 


Gly 


Lys 


Thr 


Glu 


Thr 


Trp 


Trp 


Thr 


Glu 


Val 


Asn 


Thr 


Pro 


Pro 


Leu 


Val 


Gly 


Ala 


Glu 


Thr 


Phe 


Gly Lys 


Ala 


Gly 


Tyr 


Val 


Asp 


Thr 


Thr 


Asn 


Gin 


Lys 


Ser 


Gly 


Leu 


Glu 


Val 


Asn 


Gin 


Ala 


Gin 


Pro 


Asp 


Gin 


Leu 


He 


Lys 


Lys 


Glu 


Lys 


Gly Gly 


Asn 


Glu 


Gin 


Val 


Phe 


Leu 


Asp 


Gly 


He 


Asp 


Tro 


Arg 


Ala 


Met 


Ala 


Ser 


Vaj. 


Ala 


Ser 


Cys 


Asp 


Lys 


Asp 


Cys 


Ser 


Pro 


Gly 


He 


He 


Leu 


Val 


Ala 


Val 


His 


Ala 


Glu 


Thr 


Gly 


Gin 


Glu 


Pro 


Val 


Lys 


Thr 


He 


His 


Arg 


Ala 


Ala 


Cys 


Trp 


Trp 


Pro 


Gin 


Ser 


Gin 


Gly Val 


Gly Gin 


Val 


Arg 


Asp 


Gin 


Phe 


He 


His 


Asn 


Phe 


Lys 


Arg 


He 


Val 


Asp 


He 


He 


He 


Thr 


Lys 


He 


Gin 


Asn 


Trp 


Lys 


Gly 


Pro 


Ala 


Lys 


Asp Asn 


Ser 


Asp 


He Lys 


Tyr Gly 


Lys 


Gin 


Met 


Ala 



SEQ ID NO: 39 



FIGURE 34B 

Pro He Val Leu Pro Glu 
Val Gly Lys Leu Asn Trp 
Leu Cys Lys Leu Leu Arg 
Glu Glu Ala Glu Leu Glu 
His Gly Val Tyr Tyr Asp 
Gly Gin Gly Gin Trp Thr 
Thr Gly Lys Tyr Ala Arg 
Thr Glu Ala Val Gin Lys 
Pro Lys Phe Lys Leu Pro 
Tyr Trp Gin Ala Thr Trp 
Val Lys Leu Trp Tyr Gin 
Tyr Val Ala Gly Ala Ala 
Thr Asn Arg Gly Arg Gin 
Thr Ala Leu Gin Ala He 
He Val Thr Ala Ser Gin 
Ser Glu Ser Glu Leu Val 
Val Tyr Leu Ala Trp Val 
Asp Lys Leu Val Ser Ala 
Lys Ala Gin Asp Glu His 
Asp Phe Asn Leu Pro Pro 
Cys Gin Leu Lys Gly Glu 
Trp Gin Leu Ala Cys Thr 
Val Ala Ser Gly Tyr He 
Thr Ala Tyr Phe Leu Leu 
Thr Ala Asn Gly Ser Asn 
Ala Gly He Lys Gin Glu 
Val Ala Ser Met Asn Lys 
Ala Glu His Leu Lys Thr 
Arg Lys Gly Gly Jle Gly 
Ala Thr Asp He Gin Thr 
Phe Arg Val Tyr Tyr Arg 
Leu Leu Trp Lys Gly Glu 
Val Val Pro Arg Arg Lys 
Gly Asp Asp Cys Val Ala 



Lys 


Asp 


Ser Trp Thr 


Val 


Ala 


Ser 


Gin He Tyr 


Pro 


Gly Thr 


Lys Ala Leu 


Thr 


Leu 


Ala 


Glu Asn Arg 


Glu 


Pro 


Ser 


Lys Asp Leu 


He 


Tyr 


Gin 


He Tyr Gin 


Glu 


Met 


Arg 


Gly Ala His 


Thr 


He 


Thr 


Thr Glu Ser 


He 


He 


Gin 


Lys Glu Thr 


Trp 


He 


Pro 


Glu Trp Glu 


Phe 


Leu 


Glu 


Lys Glu Pro 


He 


Asn 


Arg 


Glu Thr Lys 


Leu 


Lys 


Val 


Val Thr Leu 


Thr 


Tyr 


Leu 


Ala Leu Gin 


Asp 


Tyr 


Ala 


Leu Gly He 


He 


Asn 


Gin 


He He Glu 


Gin 


Pro 


Ala 


His Lys Gly 


He 


Gly 


He 


Arg Lys Val 


Leu 


Glu 


Lys 


Tyr His Ser 


Asn 


Val 


Val 


Ala Lys Glu 


He 


Ala 


Met 


His Gly Gin 


Val 


His 


Leu 


Glu Gly Lys 


Val 


Glu 


Ala 


Glu Val He 


Pro 


Lys 


Leu 


Ala Gly Arg 


Trp 


Phe* Thr 


Gly Ala Thr 


Val 


Phe 


Gly 


He Pro Tyr 


Asn 


Glu 


Leu 


Lys Lys He 


He 


Ala 


Val 


Gin Met Ala 


Val 


Gly Tyr 


Ser Ala Gly 


Glu 


Lys 


Glu 


Leu Gin Lys 


Gin 


Asp 


Ser 


Arg Asn Pro 


Leu 


Gly Ala 


Val Val He 


Gin 


Ala 


Lys 


He He Arg 


Asp 


Ser Arg Gin Asp Glu Asp 
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(57) Abstract: FirM generation adenoviral vectors and associated recombinant adcnoviais-hased HIV vaccines which show en- 
hanced stability and growth properties and greater cellular-mediated immunity are described within this specitlcaiion. These ade- 
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modificaiion thereof, inducing a cellular immune response which specitlcatly recogni'ices HIV-1. The exemplitled polynucleotides 
of the present invention are synthetic DNA molecules encoding HIV-1 Gag, encoding codon oplimized HlV-1 Pol. derivatives of 
optimized HIV- 1 Pol (including constructs wherein protease, reverse imnscriptase. RNAse H and integrase activity of HIV-1 Pol 
is inactivated). HIV-1 Nef and derivatives of optimized HIV-1 Nef. including nef mutants which clTccl wild type characlerisiics of 
Nef. such as myrisiylation and down regulation of host CD4. The adenoviral vaccines of the present invention, when administered 
alone or in a combined modality regime, will oiler a prophylactic advantage to previously uninfected individuals and/or provide 
a therapeutic etTeci by reducing viral load levels within an inlecled individual, thus prolonging the asymptomatic phase of HIV- 1 
infection. 
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particle. 

The special techmcal feature of group 5, 17 and 32 is considered to be a meihod of producing a cellular metfated immune res-ponse to 
the heterologous protein encoded by the diflerent adenoviral vectors. Each group contains different sequences a encoding different 
protein, therefore the resulting immune response will also be different. 

The special technical feature of group 6. 18 and 33 is considered to be a meihod of producing a ceUular mediated immune response to 
the heterologous protein encoded by the different adenoviral vectors in conjuncuon with immunizing the individual a DN A plasmid 
vaccine. Each method contains different sequences encoding a different protein, therefore the resulting immune response wUl also he 
different. 

Accordingly, groups 1-48 are not so linked by the same or corresponding technical feamrc as to form a single general inventive 
concept. 
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and AE3 , the vector contains ihe cis- acting packa^inK sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Pol protein (SEO ID NO: !) 
inserted in E I . 


14 


55 


The claim is directed to an adenoviral vector that is at least partially deleted of AE! 
and AE3 , Che vector contains the cis- acting packafiing seouence of the wild tVDe 
adenovirus genome, and a gene which encodes an HIV Pol protein (SEQ ID NO: 5) 
inserted in El. 


IS 


55 


The claim is directed to aii adenoviral vector that is at lieasi partially deleted of AEl 
and AE3 . the vector contains the cis- acting packagmg sec^uence of the wild type 
adenovirus genome, and a gene which encodes an HIV Pol protein (SEO ID NO; 7j 
inserted in El. 


16 


57-61 


The claims are directed to a method of making and harvesting of a recombinant 
adenoviral particle that contains a ^enc encoding an HIV Pot protein. 


17 


62, 65, 66 


The claim is directed to a method of generating a cellular mediated immune response 
to HIV Pol protein with the recombinant adenoviral particle. 


18 


63, 64 


The claim is directed to a method of generating a cellular mediated immune response 
to HIV Pol protein vfiih the recombinant adenoviral particle in addition to 
administering a DNA plasmid vaccine. 


19 


67-70, 72. 
73, 75 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl. the vector contains the cis-acting packaging sequence of the v/ild type 
adenovirus genome, and a gene vJhich encodes an HIV Nef protein (SEQ ID NO: 9) 
inserted in the parallel orientation of El . 


20 


67-70. 72. 
73. 75 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl , the vector contains the cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: 1 1) 
inserted in the parallel orientation of E 1 . 


21 


67-70. 72, 
73. 75 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl, the vector contains the cis-acting packaging sequence of the wild type 
aoenovirus genome, ana a gene wnicn encooes an rxiv INei protein (w>liQ JU INO: 13) 
inserted in the parallel orientation of El. 


22 


67-70, 72. 


The claims are directed to an adenoviral vector that is at least partially deleted uf 
AEl. the vector contains the cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene ^M^uch encodes an HIV Nef protein (SEQ ID NO: 15) 
inserted in (he parallel orientation of £1. 


23 


71 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl . 
the vector contains the cis-acting packaging sequence of the wild type adenovirus 
genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: 9) inserted in 

the antiparallel orientation of El. 


24 


71 


The claim is ejected to an adenoviral vector that is at least partially deleted of AEl , 
the vector contains the cis-acting packaging sequence of the wild type adenovirus 
genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: 1 1) inserted in 
the antiparallel orientation of E 1 . 


25 


71 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl , 
the vector contains the cis-acting packaging sequence of the wild type adenovirus 
genuine, iulu a gciic wtuv.li ciiLU(.ica oil niv rNci proiem (obi^ lu inui i inserieo m 
the antiparallel orientation of El. 


26 


71 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl . 
the vector contains the cis-acting packaging sequence of the wild type adenovirus 
genome, and a gene which encodes an HIV Nef nrotein fSFO ID NO* inu>rri»H in 
the antiparallel orientation of E 1 . 


27 


74 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 . the vector contains the cis-uciin)' packaging sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: 9) 
inserted in £1. 


28 


74 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 .the vector contains the cis-actine packaging seouence of the wild tvpe 
adenovirus genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: H) 
inserted in El . 


29 


74 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 . the vector contains the cis-acting packaging sequence of the wild type 
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and AE3 , the vector comains thi? r.i.s-fli;iii3g pMckHging sequence t)f die wild type 
nrlennvirus genome, and a sene vAdch encodes an HIV Pol protein (SEO ID NO: 1) 
inserted in El. 


14 


55 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 , the vector contains the cis-acting packaging sequence of the wild type 
adenovinis genome, and a gene which encodes an HIV Pol protein (SEQ ID NO: 5) 
inserted in El. 


15 


55 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 . the vector contains the cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Pol protein {SEQ ID NO: 7) 
inserted in E 1 . 


16 


57-61 


The claims are directed to a method of making and harvesting of a recombinant 
adenoviral particle that contains a gene encoding an HIV Pol protein. 


17 


62. 65. 66 


The claim is directed to a method of generating a cellular mediated immune response 
to HIV Pol protein with the recombinant. adenoviral particle. 


18 


63. 64 


The claim is directed to a method of generating a cellular mediated immune response 
tn HIV Pol protein with the recombinant adenoviral particle in addition to 
administering a DNA plasmid vaccine. 


19 


67-70, 72, 
73.75 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl . the vector contains the cis-acting packaging sequence of the wild type 
adenovinis senome. and a gene which encodes an HIV Nef protein (SEQ ID NO: 9) 
inserted in the parallel orientation of E 1. 


20 


67-70. 72, 
73, 75 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl . the vector contains the cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene v/hich encodes an HIV Nef protein fSEQ ID NO: 11) 
inserted in the parallel orienution of El . 


21 


67-70. 72, 
73,75 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl , the vector contains the cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: 13) 
inserted in the parallel orientation of E I . 


22 


67-70. 72, 
73, 75 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl. the vector contains the cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: 15) 
inserted in the parallel orientation of E 1 . 




71 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl. 
the vector contains the cis-acting packaging sequence of the wild type adenovirus 
genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: 9) inserted in 
the antiparallel orientation of El. 


24 


71 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl. 
the vector contains the cis-acting packaging sequence of the wild type adenovinis 
genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: U) inserted in 
the antiparallel orientation of El. 


25 


71 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl, 
the vector contains the cis-acting packaging sequence of the wild type adenovirus 

genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: 13) inserted in 
the antiparallel orientation of El. 


26 


71 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl. 
the vector contains the cis-acting packaging sequence of the wild type adenovirus 
eenome. and a gene which encodes an HIV Nef protein (SEQ ID NO: 15) inserted in 
the antiparallel orientation of El. 


27 


74 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 , the vector contains the cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: 9) 
inserted in E I . 


28 


74 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 , the vector contains the cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: 1 1) 
inserted in E 1 . 


29 


74 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 , the vector contains the cis-acting packaging seciuence of the wild type 
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j rt^nnmi* nnrf a wene which encodes an HIV Net proiein (SEQ ID NO: 13) 
inserted in El. ■ 


30 


74 


The claim is directed to an adenoviral vector that is at least pariially deleted oi am 
and AE3 , the vector contains Ine cis-aciing pacKaging sequence ui me wuu vyy^ 
j^deniTVir- e-nnme, and a gene which encodes an HIV Nef protein (SEO ID NO. 15) 
inserted in E 1 . 


31 


76-80 


The claims are directed to a method of making and harvesting of a recombinant 
adenoviral particle that coniains a gene encoding an HIV Nef protein. 


32 


81,84, 85 


The claims axe directed to a method of generating a cellular mediated immune 
response to HIV Nef with the recombinant adenoviral particle. 


33 


82. 83 


The claims are directed to a method of generating a cellular mediated immune 

tn uiv Npf with the recombinant adenoviral oarticle in addition to 

administeriiiK a DNA plasmid vaccine. 


34 


86a 


The claim is drawn to a multivalent vaccine wherein ^^^2^. pol and nefaiQ expressed 
from three individual vectors. 


35 


86b, 88. 89 


The claims arc drawn to a multivalent vaccine wherein gag, pol and nefaie expressed 
from one individual vectors. 


36 


86c, 88 


Tu^ Mo'tTY^b nri» Hrn\Am to ft multivalent vaccine \N^erein kok, poi and nef are expressed 
from two individual vectors, one expressing nef-pot ftision and one expressing gag. 


37 


86d, 87. 88 


from two individual vectors, one expressing gag-pol ftision and one expressing nef. 


38 


86e, 88 


The claims are drawn to a multivalent vaccine «4ierein gag, pol and nef are expressed 
from two individual vectors, one expressing nef-gag fusion and one expressing poL 


39 


86f, 88 


The claims are uravm to a muiitvaieni vaccmc wnerein gag, fjui miu ficj mc 
from a sin^e vectors as a fusion protein. 


40 


86g, 88 


TV.,, ^fnime oro /Vnuun tn n multivalent vaccine vAierein £a£ and pol are expressed 
from two individual vectors. 


41 


86h. 88, 89 


individually from one vector. 


86i.88 


1 lie Claims arc ku <iwii *xj a liiuiH'*****"*^ i^**" '"-j — — ■ i-- 
from two individual vectors. 


43 


86j.88. 89 


<>•• 1^:— ,» nra. Amwm tr\ a rmil tivalf»Tit vnrciiie whercin Dol and/ts/'are exoressed 

Toe claims are orawn uj n rnuinvrtjuiu viiti^uic wiKii^iii^v* cuiur>bj «uv vn^^ww 

from individually from one vector. 


L AA 


86k, 88 


The claims are drawn to a multivalent vaccine wherein /le/and gag are expressed 
mdividaally from one vector. 




861. 88, 89 


The claims are drawn lo a multivalent vaccuie wlierein nef and gag are expressed 
individually from one vector. 


1 


86m. 88 


The clairhs are drawn to a multivalent vaccine wherein gag and pol are expressed as a 
fusion protein from one vector. 


r47 


86n. 88 


The claims are drawn to a mulUvalent vaccine wherein pol and ne/are expressed as a 
fusion protein from one vector. 




86o. 88 


The claims are drawn to a multivcdent vaccine wherein nd/and gag are expressed as a 
fusion proiein from one vector. 



The inventions listed as Groups 1-48 do not relate to a single general inventive concept under PCT Rule 13.1 because, under PCT 
Rule 13.2. they lack the same or corresponding special technical features lor the toUowmg reasons: 

The lechnical feature linkmg groups 1-33 appears to be a recombinant adenoviral vector wherein the adenoviral vector is at least 
P^iriiallv deleted in El but the vector may contain more deletions, the vector contains wild type sequences includmg packaging signals 
'"^^ « gene encoding a heterologous HIV protein or fragments thereof. ErU et al. (WO 96/39178) disclose a recombimmi adenoviral 
V'Jcior that is deleted in El and partially deleted in E3. die remainder of the adenoviral vector contains wdd type sequences. The 
'•^^^*t'»T additionally contains an insertion of a heterologous protein which includes HIV proteins (see abstract and claims 1 and 5). 
Therefore, the technical feanirc Unking the inventions of groups 1-45 does not constitute a special lechnical feature as defmcd by PCT 
13.2, as it does not define a contribution over the prior art. 

"^^^^^ iipeciia lechnical feature of the following groups 1-3, 7-15. 19-30 and 34-48 is considered to be the combircuion of sequences that 
'^^li^^loscd m each group, see individual claim groupings above lor Uic dilTerem sequences. The DNA disclosed ui each group is 
"'•^'dc itp (iirf„.e„. seauence having a different structure and dilTercni amclion. 
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BOX n. OBSERVATIONS WHERE UNITY OF INVENTION IS LACKING 

This fipplication ccmtaixis the foUowing inventions or groups of invendons which are not so linked as to fonn a single general 
inventive concept under PCX Rule 13.1. In order for aU inventions to be examined, ±e jqppropriflte adcfitional examination fees must 
be paid. 



Group 


Claims 




1 


1-5,8-11, 
13-18, 29, 
30, 31, 32, 
34, 35, 37 


The claims are directed to an adenoviral vector that is at least partifilly deleted of 
AEl, the vector contains the cis-acting packaging sequence of the wild type 
ndenovinis genome, anc a ^ene WDicn encooes an tiiv uaKproucin uj vtyj. 
inserted in the oarallel orientation of El. In addition the vector contains a promoter 
and a pdyadenylation signal. 


2 


6, 7, 36 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl and AE3, die vector contains the cis-acting packagmg secjuence of the wQd type 
adenovirus genome, and a gene ^ch encodes an HIV Gag protein (SEQ ID NO; 
29). 


3 


12,33 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl, the vector contains the cis-acting packaging sequence of the wild Q^pe 
adenovirus genome, and a gene v/bXch encodes an HTV protein inserted in the 
antiparallel orientation of El . 


4 


19-23, 38-42 


The claims are directed to a method of making and harvesting of a recombmant 
adenoviral particle that contains a gene encoding an HIV Gag protem. 


5 


24, 27, 28, 
43, 46,47 


The claim is directed to a method of generating a cellular mediated immune response 
to HIV Gag protein with the recombinant adenoviral particle. 


6 


25,26, 44. 
45 


The claim is directed to a method of generating a cellular mediated immune response 
tn HTV fjftg protein with the recombinant adenoviral nartide in addition to 
afimini Pterins a DNA plasmid vaccine. 


7 


48-51,53, 
54.56 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl , the vector contains the cis>acting packaging sequence of the wild type 
adenovirus genome, and a gene vAnch encodes an HIV Pol protein (SEQ ID NO: 1) 
inserted in the parallel odencation of El. 


8 


48-51, 53. 
54,56 


The giMwiB are directed to an adenoviral vector diat is at least partially deleted of 
AEl , the vector contains the cis-acting packaging sequence of the wild type 
ftdenovinis gemmae, and a eene which encodes an HIV Pol protein (SEQ ID NO: 5) 
inserted in the parallel orientation of El . 


9 


48-51, 53. 
54, 56 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl, the vector gnntains the cis-acting padcaging sequence of the wild type 
adenoionis genome, and a eene vAucfa encodes an HIV Pol protein (SEO ID NO: 7) 
Inserted in the parallel orientation of El . 


10 


52 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl, 
die vector contains the cis-acting packaging sequence of the wild type adenovirus 
penome. and a Ecne vAich encodes an HIV Pol protein (SEQ ID NO: 1) inserted in 
the antiparallel orientation of El. 


U 


52 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl, 
the vector contains the cis-actuag packaging sequence of the wild type adenovirus 
genome, and a gene v^ch encodes an HIV Pol protein (SEO ID NO: 5) mserted in 
the antiparallel orientation of El. 


12 


52 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl, 
the vector contains the cis-acting packaging sequence of the wild type adenovirus 
pencMne. and a eene which encodes an HIV Pol protein (SEO ID NO: 7) mserted in 
the antiparallel orientadon of El. 


13 


55 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
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and AE3 , the vector contains the cis-acting packagii^g sequence of the wild type 
adenovirus genome, and a gene viWch encodes an HIV Pol protein (SEO ID NO- n 
msertedinEl. 


14 


55 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 , the vector contams the cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Pol protem (SEO ID NO- 5^ 
msertedinEl. ■ — £ 


15 


55 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 . Oifi vector contains the cis-acting packaging sequence of the wild lyp^ 
adenovirus genome, and a gene vhich encodes an HIV Pol protein rSEO ID NO- 7^ 
inserted in EI. ^ 


16 


57-61 


The claims are directed to a method of nwlring and harvesting of a recombinant 
adenoviral particle that contains a Rene encoding an HIV Pol protein. 


17 


62, 65.66 


The clann is directed to a method of generating a cellular mediated immine response 
to HIV Pol protein with the recombbmt adenoviral nartide 


iS 


63,64 


The clann is directed to a method of generating a celhilar mediated hnmune response 
to HTV Pol protein with the recombinant adenoviral particle in addition to 
admimsteripfi a DNA plasmid vaccine. 


19 


67-70. 72, 
73,75 


The claims are (firected to an adenoviral vector that l« at iMct nBrtiaiiir <4Aia*a<i 
AEl . the vector contains the cis-acting padcagmg sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Nef protein (SEO H) NO* 9) 
mserted in the parallel orientation of El. 


20 


67-70. 72, 
73»75 


The claims are directed to an adeoovira] vector that is at leiL<:t nnrriniiv HdW^n 
AEl , tiie vector conlams the cis-acting padcaging sequence of die wild type 
adenovirus genome, and a gene iK^ch encodes an HIV Nef protein (SEO ED NO- 1 n 
mserted m the parallel orientation of El. 


21 


67-70, 72, 
73,75 


The claims are directed to an adenoviral vector that is at \&rs.\ nnrtintiv AtA^tt^A 
AEl , the vector contahas the cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Nef protem ^SEO ID NO: 13^ 
mserted m the parallel orientation of E 1 . 


u. 


67-70, 72. 
73,75 


The claims are directed to an adenoviral vector thnt i<: At Imr^ narHniiu AtAa\^A 
AEl , the vector contains the cis-acting packagmg sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Nef protein fSEQ ID NO: I S) 
mserted m the parallel orientation of E 1 . 


23 


71 


I he claim is directed to an adenoviral vertnr thnr ic at looct nar^Diiir a.a^^^a a v-i 
,444 uuwuwTuoi v^i.u>i uiui i2> Hi icosi pamaiiy ueieteo oi AEl 

the vector contains the cis-acting packaging sequence of the wad type adenoviras""' 

genome, and a gene vtoch encodes an HTV Nef protem (SEQ ID NO: 9^ inserted in 

die antiparallel orientation of E 1 . 


24 


71 


Ihe claim is directed to an adenoviral vpririr tViat ic nt- i«>acr nci*>»^ait<r a^a^^a at?« 
the vector contains the cis-acting packaging sequence of die wUd ^e adenovirus^' 
genome, and a gene which encodes an HIV Nef protem rSEQ ID NO: 1 n inserted in 
the antiparallel orientation of E 1 . 


25 


71 


ixie clHim IS directed to an adennvirfil vprfm* that <o of lnnr># MMf^Mii.. j— t— «^ j ^ ^ 

„ <Mu«>wMru. \»j «ji auK^vjviioi vc(^uur uiai is at lease partuuiy deleted of AEl 

die vector contains die cis-acting packaging sequence of the wild type adenoviruT^' 
genome, and a gene vshich encodes an HIV Nef nrotem (SEO ID NO: 13^ inserted in 
die antiparallel orientation of E 1 . 


26 


71 


TjlS claim is directed tn fm nHprmvirat tkn* £n lA..>a» ->__.j.fi « ^sw. ' 

» ufc. \,Muui u uucuiEu lu wi aocBovirai vector t&at is at least partially deleted of AEl 
the vector contains die cis-acting packagmg sequence of the wUd type adenoviitS^' 
genome, and a gene viiich encodes an HIV Nef protein (SEQ ID NO: 15^ inserted in 
die antiparallel orientation of El. 


27 


74 


The clann is directed to an adenoviral vector that is at least partially deleted of AE 1 
and AE3 , die vector contams die cis-acting packaging sequence of the wild 
adenovirus genome, and a gene which encodes an HIV Nef protein «EO ID NO- ^)^ 
inserted in El . 


28 


74 


The clann is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 , die vector contmns the ds-acting packaging sequence of the wUd type 
adenovirus genome, and a gene viiich encodes an HIV Nef proteui fSEO ID NO* 1 n 
inserted in El. * — 


29 


74 


The claim is directed to an adenoviral vector dial is at least partially deleted of AEl 
and AE3 . die vector contains the cis-acting packagmg sequence of the wild type"^ 
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adenovirus genome, and a gene uAuch encodes an HIV Nef protein (SEQ ID NO: 13) 
inserted in El. 


30 


74 


The daim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 , the vector coniains die cis-actiDfE packaging sequence of the wild type 
adenovinis genome, and a gene wdiich encodes an HIV Nef protein (SEQ ID NO: 15) 
inserted In El. 


31 


76-80 


The daims are directed to a method of making and harvesting of a recomibmant 
ademviral pertide that contains a gene encoding an HIV Nef protein. 


32 


81,84,85 


The claims are directed to a method of generating a cellular mediated immune 
re^onse to HIV Nef with the recombinant adenoviral particle. 


33 


82, 83 


The claims are (Reeled to a method of generating a cellular mediated tirmmnft 
response to HIV Nef widi the recombinant adenoviral partide in addition to 
administering a DNA plasmid vaccine. 


34 


86a 


The claim is drawn to a multivalent vaccine wherein gag, pol and n^aic expressed 
from three iadividual vectors. 


3S 


86b, 88, 89 


The claims are drawn to a multivalent vaccine v^ierein gag, pol <"ir! nef toe exptcsssd 
from one individual vectors. 


36 


86c. 88 


The daims are drawn to a multivalent vaccine ^i^dierein gag, pol atwt neffoe euqiressed 
from two individual vectors, one e}q>ressing nef -pol fusion and one egqnessing gag. 


37 


86d, 87. 88 


The claims are drawn to a multivalent vaccine wtoehi gag, pol and nef art e3q>res$ed 
from two individual vectors, one expressing gag-pol fusion and one expressing nef. 


38 


86e,88 


The claims are drawn to a multivalent vaccine vteein gag, pol and nef are ejqvressed 
from two individual vectors, otie expresdi^ nef-^g fusicni and otie expressing poL 


39 


86f,88 


The daims are drawn to a multivalent vaccine wheidn gag, pol and n£f are e3q)res8ed 
from a single vectors as a i^ion protein. 


40 


86g,88 


The glnwnfl are dra'woi to a multivalent vaccine wherein gag and pol are e?q>ressed 
from two individnal vectors. 


41 


86h,88.89 


The daiiEis are drawn to a multivalent vaccine vdierein gag and pol are expressed 
individually from one vector. 


42 


86i, 88 


The daims are drawn to a multivalent vaccine wherein pol and nef sue expressed 
from two individual vectors. 


43' 


86j, 88, 89 


The claims are drawn Co a multivalent vaccine wl^rein pol and nefaie eiq^ressed 
from individually from one vector. 


44 


86k, 88 


The daims are drawn to a multivalent vaccine wherein nef and gag are expressed 
individually from one vector. 


45 


861, 88, 89 


The daims are drawn to a multivalent vaccine wherein and gag are expressed 
individually from one vector. 


46 


86m, 88 


The claims are drawn to a multivalent vaccine whereia gag and pol are expressed as a 
f\isiQn protein from one vector. 


47 


86ii« 88 


The claims are drawn to a multivalent vaccine wherein pol and nef ate expzesscd as a 
fusion protein from one vector. 


48 


860,88 


The daims are draim to a multivalent vaccine wherein nefrnd gag are expressed as a 
fusion protein from one vector. 



The Inventions listed as Groins 1-48 do not rdate to a rangle general inventive concept under PCT Rule 13. 1 because, under PCT 
Rule 13.2. they lack the same or oarrespondiog spedal techntcd features for the followii^ reasons: 

The tedmical feature linkiotg groups 1-33 iq;»pears to be a recombinant adenoviral vector Mvlierein the adenoviral vector is at least - 
partially deleted in El but the vector may contain more deletions, the vector contains wild type sequences including packaging signals 
and a gene encoding a heterologous HIV protein or fragments tiiereof. Erfl ct al. (WO 96/39178) disdose a recombinant adenoviral 
vector ttiat is ddeted in El and partially deleted in E3, the remainder of the ademniral vector couiains wild ^e sequences. The 
vector additionally contains an msertion of a hetercslogous protein vAAck includes HIV proteins (see abstract and claims 1 and 5). 
Therefore, the technical feature linkiqg the inventions of groups 1-45 does not constimte a special technical feature as defined by PCT 
Rule 13.2, as it does not deflne a cantribution over the prior art. 

The special technical feature of tiie following groups 1-3, 7-15, 19-30 and 3448 is considered to be the combination of sequeiKes that 
is disdosed in each groiq), see individual daim groupings above for flie different sequences. The DNA disclosed in each group is 
made tip of a different sequence having a different structure and different function. 
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The special technical feature of grot^ 4« 16 and 31 is considered to be a method of produciii^ recombinant adenoviral particles. Each 
group contains different sequences hence the resulting particles would have different structures and fimctioais assodat^ vnth the 
particle. 

The special technical feature of group 5, 17 and 32 is considered to be a method of producing a ceUular mediated inmninei response to 
the heterologous protein encoded by the different adeno^al vectors. Each group contains (fifferent sequences a encoffiog different 
proton, therefore the resulting itw^ iTie response will also be difterenL 

The special techmca] feature of group 6, 18 and 33 is considered to be a method of produdotg a cellular mediated imnminft reqKsnse to 
the heterologous protein encoded by the ^fei«nt adenoviral vectors in ccnyuncdon widi immxnmn^ individual a DMA plasmld 
vaccine. Each mediod coatains different sequences encoding a different proteiiu therefore the residtii^ immone response will also be 
different 

Accorduigly , groins 1-48 are not so linked by the same or correqpondiog technical feature as to form a single general invetitive 
concept. 
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^ (57) Abstract: First generation adenoviral vectors and associated recombinant adenovirus-based HIV vaccines which show en- 
\2 hanced stability and growth properties and greater cellular- mediated inununity are described within this specification. These ade- 
^ noviral vectors are utilized to generate and produce through cell culture various adenoviral-based mV-l vaccines which contain 
a irrV-l gag. HIV-1 pol and/or HIV-l nef polynucleotide pharmaceutical products, and biologically relevant modifications thereof. 
00 'I'hesc adenovirus vaccines, when directly introduced into living vertebrate tissue, preferably a mammalian host such as a human or a 
O non-human mammal of commercial or domestic veterinary importance, express the HIVl- Gag, Pol and/or Nef protein or biologically 
^ modification thereof, inducing a cellular immune response which specifically recognizes HIV-1. The exemplified polynucleotides 
2 of the present invention are synthetic DNA molecules encoding HlV-1 Gag, encoding codon optimized HIV-1 Pol, derivatives of 
^ optimized IWV-l Pol (including constructs wherein protease, reverse transcriptase, RNAse H and integrase activity of HIV-1 Pol 
^ is inactivated), IQV-l Nef and derivatives of optimized HlV-l Nef, including nef mutants which effect wild type characteristics of 
^ Nef, such as myrislylation and down regulation of host CD4. The adenoviral vaccines of the present invention, when administered 
O alone or in a combined modality regime, will offer a prophylactic advantage to previously uninfected individuals and/or provide 
^ a therapeutic effect by reducing viral load levels within an infected individual, thus prolonging the asymptomatic phase of HTV-l 
^ infection. 
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TITLE OF TOE INVENTION 

ENHANCED FIRST OEN'ERATION ADENOVIRUS VACCINES EXPRESSING 
CODON OPTIMIZED HIVl-GAG, POL, NEF AND MODIFICATIONS 

5 CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit, under 35 U.S.C. §1 19(e), of U.S. 
provisional applications 60/233,180, 60/279,056, and Attorney Docket 20867PV2 
(serial number unassigned), filed September 15, 2000, March 27, 2001, and 
September 7, 2001, respectively. 

10 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
Not Applicable 

REFERENCE TO MICROFICHE APPENDIX 
15 Not Applicable 

^TELD OF IHE INVENTION 

The present invention relates to recombinant, replication-deficient first 
generation adenovirus vaccines found to exhibit enhanced growth properties and 

20 greater cellular-mediated immunity as compared to other replication-deficient vectors. 
The invention also relates to the associated first generation adenoviral vectors 
described herein, which, through the incorporation of additional 5' adenovirus 
sequence, enhance large scale production efficiency of the recombinant, replication- 
defective adenovirus described herein. Another aspect of the instant-invention is ttie 

25 surprising discovery that the intron A portion of the human cytomegalovirus (hCMV) 
promoter constitutes a region of instability in adenoviral vector constructs. Removal 
of this region firom adenoviral^expression constructs results in greatly improved vector 
stability. Therefore, improved vectors expressing a transgene under the control of an 
intron A-deleted CMV promoter constitute a further aspect of this invention. These 

30 adenoviral vectors are useful for generating recombinant adenovirus vaccines against 
human immunodeficiency virus (HTV). In particular, the first generation adenovirus 
vectors disclosed herein are utilized to construct and generate adenovirus-based HTV- 
1 vaccines which contain HIV-1 Gag, HtV-l Pol and/or HIV-1 Nef polynucleotide 
pharmaceutical products, and biologically active modifications thereof. Host 

35 administration of the recombinant, replication-deficient adenovirus vaccines described 
herein results in expression of HIV-1 Gag, HTV-l- Pol and/or Nef protein or 
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immunologically relevant modifications thereof, inducing a cellular immune response 
which specifically recognizes HEV-l. The exemplified polynucleotides of the present 
invention are synthetic DNA molecules encoding codon optimized HIV-1 Gag, HTV^l 
Pol, derivatives of optimized HTV-l Pol (including constructs wherein protease, 
5 reverse transcriptase, RNAse H and integrase activity of HIV-1 Pol is inactivated). 
HTV-l Nef, and derivatives of optimized HIV-1 Nef, including nef mutants which 
effect wild type characteristics of Nef, such as myristylation and dovm regulation of 
host CD4. The HIV adenovirus vaccines of the present invention, when administered 
alone or in a combined modality and/or prime/boost regimen, v^dll offer a prophylactic 
10 advantage to previously uninfected individuals and/or provide a therapeutic effect by 
reducing viral load levels within an infected individual, thus prolonging the 
asymptomatic phase of HIV-1 infection. 

BACKGROUND OF THE INVENTION 

15 Human Immunodeficiency Virus-1 (HTV-l) is the etiological agent of 

acquired human immune deficiency syndrome (AIDS) and related disorders, HTV-l 
is an RNA virus of the Retroviridae family and exhibits the S'LTR-gag-pol-env- 
LTR 3' organization of all retroviruses. The integrated form of HIV-1, known as the 
provinis, is approximately 9.8 Kb in length. Each end of the viral genome contains 

20 flanking sequences known as long terminal repeats CLTRs), The HIV genes encode at 
least nine proteins and are divided into three classes; the major structure proteins 
(Gag, Pol, and Env), the regulatory proteins (Tat and Rev); and the accessory proteins 
(Vpu, Vpr, Vif and Nef). 

The gag gene encodes a 55-kilodalton (kDa) precursor protein (p55) which is 

25 expressed firom the unspliced viral mRNA and is proteolytically processed by the HTV 
protease, a product of the po/ gene. The mature pSS protein products are pl7 
(matrix), p24 (capsid), p9 (nucleocapsid) and p6. 

The pol gene encodes proteins necessary for virus replication; a reverse 
transcriptase, a protease, integrase and RNAse H. These viral proteins are expressed 

30 as a Gag-Pol fusion protein, a 160 kDa precursor protein which is generated via a 
ribosomal frame shifting. The viral encoded protease proteolytically cleaves the Pol 
polypeptide away from the Gag-Pol fusion and further cleaves the Pol polypeptide to 
the mature proteins which provide protease (Pro. PIO), reverse transcriptase (RT, 
P50), integrase (IN, p31) and RNAse H (RNAse, pl5) activities. 
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The nfi/gene encodes an early accessory HIV protein CNef) which has been 
shown to possess several activities such as down regulating CD4 expression, 
disturbing T-cell activation and stimulating HIV infectivity. 

The env gene encodes the viral envelope glycoprotein that is translated as a 
5 160-kilodaIton (KDa) precursor (gpl60) and then cleaved by a cellular protease to 
yield the external 120-kDa envelope glycoprotein (gpl20) and the transmembrane 41- 
kDa envelope glycoprotein (gp41). Gpl20 and gp41 remain associated and are 
displayed on the viral particles and the surface of HIV-infected cells. 

The tat gene encodes a long form and a short form of the Tat protein, a RNA 
10 binding protein which is a transcriptional transactivator essential for HIV-1 
replication. 

The rev gene encodes the 13 kDa Rev protein, a RNA biiiding protein. The 
Rev protein binds to a region of the viral RNA termed the Rev response element 
(RRE). The Rev protein promotes transfer of unspliced viral RNA from the nucleus 

15 to the cytoplasm. The Rev protein is required for HIV late gene expression and in 
turn, HTV replication. 

Gpl20 binds to the CD4/chemokine receptor present on the surface of helper 
T-lymphocytes, macrophages and other target cells in addition to other co-receptor 
molecules. X4 (macrophage tropic) virus show tropism for CD4/CXCR4 complexes 

20 while a R5 (T^ell line tropic) virus interacts with a CD4/CCR5 receptor complex. 
After gpl20 binds to CD4, gp41 mediates the fusion event responsible for virus entry. 
The virus fuses with and enters the target cell, followed by reverse transcription of its 
single stranded RNA genome into the double-stranded DNA via a RNA dependent 
DNA polymerase. The viral DNA, known as provirus, enters the cell nucleus, where 

25 the viral DNA directs the production of new viral RNA within the nucleus, expression 
of early and late HIV viral proteins, and subsequently the production and cellular 
release of new virus particles. Recent advances in the ability to detect viral load 
within the host shows that the primary infection results in an extremely high 
generation and tissue distribution of the vmis, followed by a steady state level of virus 

30 (albeit through a continual viral production and turnover during this phase), leading 
ultimately to another burst of vmis load which leads to the onset of cbnical AIDS. 
Productively infected cells have a half life of several days, whereas chronically or 
latently infected cells have a 3-week half life, followed by non-productively infected 
cells which have a long half life (over 100 days) but do not significantly contribute to 

35 day to day viral loads seen throughout the course of disease. 
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Destruction of CD4 helper T lymphocytes, which are critical to immune 
defense, is a major cause of the progressive immune dysfunction that is the hallmark 
of HIV infection. The loss of CD4 T-<:eUs seriously impairs the body's ability to fight 
most invaders, but it has a particularly severe impact on the defenses against viruses, 
5 fungi, parasites and certain bacteria, including mycobacteria. 

Effective treatment regimens for HIV-1 infected individuals have become 
available recently. However, these drugs will not have a significant impact on the 
disease in many parts of the worid and they will have a minimal impact in halting the 
spread of infection within the human population. As is true of many other infectious 

10 diseases, a significant epidemiologic impact on the spread of HIV-1 infection will 
only occur subsequent to the development and introduction of an effective vaccine. 
There are a number of factors that have contributed to the lack of successful vaccine 
development to date. As noted above, it is now apparent that in a chronically infected 
person there exists constant virus production in spite of the presence of anti-HTV-l 

15 humoral and cellular immune responses and destruction of virally infected cells. As 
in the case of other infectious diseases, the outcome of disease is the resuh of a 
balance between the kinetics and the magnitude of the immune response and the 
:^athogen replicative rale and accessibility to the inmiune response. Pre-existing 
immunity may be more successful with an acute infection than an evolving immune 

20 response can be with an established infection. A second factor is the considerable 
genetic variability of the virus. Although anti-HTV^l antibodies exist that can 
neutralize HIV-1 infectivity in cell culture, these antibodies are generaUy virus 
isolate-specific in their activity. It has proven impossible to define serological 
groupings of HTV-l using traditional methods. Rather, the virus seems to define a 

25 serological "continuum'* so that individual neutralizing antibody responses, at best, 
are effective against only a handful of viral variants. Given this latter observation, it 
would be useful to identify immunogens and related delivery technologies that are 
likely to elicit anti-HIV-1 cellular immune responses. It is known that in order to 
generate CTL responses antigen must be synthesized within or introduced into cells, 

30 subsequently processed into small peptides by the proteasome complex, and 

translocated into the endoplasmic reticulum/Qolgi complex secretory pathway for 
eventual association with major histocompatibility complex (MHC) class I proteins. 
CDS^ T lymphocytes recognize antigen in association with class I MHC via the T cell 
receptor (TCR) and the CDS cell surface protein. Activation of naive CDS* T cells 

35 into activated effector or memory cells generaUy requires both TCR engagement of 
antigen as described above as well as engagement of costimulatory proteins. Optimal 
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induction of CTL responses usually requires "help" in the fonn of cytokines from 
CD4* T lymphocytes which lecogoize antigen associated with MHC class n 
molecules via TCR and CD4 engagement. 

European Patent Applications 0 638 316 (Published February 15, 1995) and 0 
5 586 076 (Published March 9, 1994), (both assigned to American Home Products 

Corporation) describe replicating adenovirus vectors carrying an HIV gene, including 
env or gag. Various treatment regimens were used with chimpanzees and dogs, some 
of which included booster adenovirus or protein plus alum treatments. 

Replication-defectivjs adenoviral vectors harboring deletions in the El region 
10 are known, and recent adenoviral vectors have incorporated the known packaging 
repeats into these vectors; e.g., see EP 0 707 071, disclosing, inter alia, an adenoviral 
vector deleted of El sequences from base pairs 459 to 3328; and U.S. Patent No. 
6,033,908, disclosing, inter alia^ an adenoviral vector deleted of base pairs 459-3510. 
The packaging efficiency of adenovirus has been taught to depend on the number of 
15 incorporated individual A (packaging) repeats; see, e.g., Grable and Hearing, 1990 J. 
Virol. 64(5):2047-2056; Giable and Hearing, 19927. Virol 66(2):723-731. 

Larder, et al.. (1987, Nature 327: 716-717) and Larder, et al.. (1989. Proc. 
Natl Acad. 5d. 86: 4803-4807) disclose site specific mutagenesis of HIV-1 RT and 
the effect such changes have on in vitro activity and infectivity related to interaction 
20 with known inhibitors of RT. 

Davies, et al. (1991, Science 252:, 88-95) disclose the crystal structure of the 
RNase H domain of HTV-l Pol 

Schatz, et al. (1989, FEBS Lett. 257: 31 1-314) disclose that mutations 
Glu478Gln and ffis539Phe in a complete HIV-1 RT/RNase H DNA fragment results 
25 in defective RNase activity without effecting RT activity, 

Mizrahi, et al. (1990. Nucl Acids. Res. 18: pp. 5359-5353) disclose additional 
mutations Asp443 Asn and Asp498Asn in the RNase region of the pol gene which also 
results in defective RNase activity. The autiiors note that the Asp498A5n mutant was 
difficult to characterize due to instability of this mutant protein. 
30 Leavitt, et al. (1993. /. Biol Oxem. 268: 21 13-21 19) disclose several 

mutations, including a Asp64Val mutation, which show .differing effect on HlV-1 
integrase (EN) activity. 

Wiskerchen, et al. (1995, /. Virol 69: 376-386) disclose singe and double 
mutants, including mutation of aspartic acid residues which effect HIV-1 IN and viral 
35 replication functions. 
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It would be of great import in the battle against AIDS to produce a 
prophylactic- and/or therapeutic-based HTV vaccine which generates a strong cellular 
immune response against an HTV infection. The present invention addresses and 
meets these needs by disclosing a class of adenovirus vaccines which, upon host 

5 administration, express codon optimized and modified versions of the HTV-l genes, 
gag, pol and iief. These recombinant, replication-defective adenovirus vaccines may 
be administeied to a host, such as a human, alone or as part of a combined modality 
regimen and/or prime-boost vaccination regimen with components of the present 
invention and/or a distinct viral HIV DNA vaccine, non-viral HTV DNA vaccine, HIV 

10 subunit vaccine, an HIV whole killed vaccine and/or a live attenuated HIV vaccine. 

SUMMARY OF THE INVENTION 

Tlie present invention relates to enhanced replication-defective recombinant 
adenovirus vaccine vectors and associated recombinant, replication-deficient 

15 adenovirus vaccines which encode various forms of HIV-1 Gag, HIV-1 Pol, and/or 
HTV-l Nef, including immunologically relevant modifications of HIV-1 Gag, HTV-l 
Pol and HTV-l Nef. The adenovirus vaccines of the present invention express HTV 
antigens and provide for improved cellular-mediated immune responses upon host 
administration. Potential vaccinees include but are not limited to primates and 

20 especially humans and non-human primates, and also include any non-human 
manmial of conomercial or domestic veterinary importance. An effect of the 
improved recombinant adenovims-based vaccines of the present invention should be a 
lower transmission rate to previously uninfected individuals (i.e., prophylactic 
applications) and/or reduction in the levels of the viral loads within an infected 

25 individual (i.e., therapeutic applications), so as to prolong the asymptomatic phase of 
HIV-1 infection. In particular, the present invention relates to adenoviral-based 
vaccines which encode various forms of codon optimized HTV-l Gag (including but 
in no way limited to p55 versions of codon optimized full length (PL) Gag and tPA- 
Gag fusion proteins), HIV-1 Pol, HIV-1 Nef, and selected modifications of 

30 immunological relevance. The administration, intracellular delivery and expression 
of these adenovirus vaccines elicit a host CTL and Th response. The preferred 
replication-defective recombinant adenoviral vaccine vectors include but are not 
limited to synthetic DNA molecules which (1) encode codon optimized versions of 
wild type HFV-l Gag; (2) encode codon optimized versions of HIV-1 Pol; (3) encode 

35 codon optimized versions of HIV-1 Pol fusion proteJins; (4) encode codon optimized 
versions of modified HFV-l Pol proteins and fusion proteins, including but not limited 
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to pol modifications involving residues within the catalytic regions responsible for 
RT, RNase and IN activity within the host cell; (5) encode codon optimized versions 
of wild type HTV-l Nef; (6) codon optimized versions of HIV-1 Nef fusion piotdns; 
and/or (7) codon optimized versions of HTV-l Nef derivatives, including but not 
5 limited to ne/modifications involving introductioii of an amino-tenninal leader 
sequence, removal of an amino-terminal myristylation site and/or introduction of 
dileucine motif mutations. The Nef-based fusion and modified proteins, disclosed 
within this specification and expressed from an adenoviral-based vector vaccine this 
specificaUon, may possess altered trafficking and/or host cell function while retaining 

10 die ability to be properly presented to the host MHC I complex and in turn ehcit a 
host CTL and Th response. Examples of HIV-1 Gag, Pol and/or Nef fusion proteins 
include but are not limited to fusion of a leader or signal peptide at the NH2-teriniinal 
portion of the viral antigen coding region. Such a leader peptide includes but is not 
limited to a tPA leader pepti(te. 

15 The adenoviral vector utilized in construction of the HTV-l Gag-, HIV-1 Pol- 

and/or HFV-l Nef- based vaccines of the present invention may comprise any 
•^-nlication-defective adenoviral vector which provides for enhanced genetic stability 
of the recombinant adenoviral genome through large scale production and purification 
of the recombinant vims. In other words, an HTV-l Gag-, Pol- or Nef-based 

20 adenovims vaccine of the present invention is a purified recombinant, repUcation- 
defective adenovirus which is shown to be genetically stable through multiple 
passages in cell culture and remains so during large scale production and purification 
procedures. Such a recombinant adenovirus vector and harvested adenovims vaccine 
lends itself to large scale dose filling and subsequent worldwide distribution 

25 procedures which wDl be demanded of an efficacious monovalent or multivalent HTV 
vaccine. The present invention meets this basic requirement with description of a 
replication-defective adenoviral vector and vectors derived thereftom. at least 
partiaUy deleted in El, comprising a wUdtype adenovirus cis-acting packaging region 
from about base pair 1 to between from about base pair 342 (more preferably, 400) to 

30 about base pair 458 of the wildtype adenovirus genome. A preferred embodiment of 
the instant invention comprises base pairs 1-450 of a wildtype adenovirus. In other 
preferred embodiments, the replication -defective adenoviral vector has, in addition 
thereto, a region 3' to the El -deleted region comprising base pairs 3511-3523. 
Basepairs 342-450 (more particularly, 400-450) constitute an extension of the 

35 5'region of previously disclosed vectors carrying viral antigens, particularly HTV 
antigens (see, e.g., PCT Intemadonal Application PCT/USOO/1 8332, published 
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January 1 1, 2001 (WO 01/02067), which claims priority to U.S. Provisional 
Application Serial Nos. 60/142,631 and 60/148,981, filed 7/6/1999 and 8/13/1999, 
respectively; these documents herein incorporated by reference. Applicants have 
found that extending the 5' region further into the El gene into the disclosed vaccine 
5 vectors incorporated elements found to be important in optimizing the packaging of 
the virus. 

As compared to previous vectors not comprising basepairs from about 1 to 
between from about base pair 342 (more preferably, 400) to about base pair 458 of the 
wildtype adenovirus genome, vectors comprising the above region exhibited enhanced 

10 growth characteristics, with approximately 5-10 fold greater amplification rates, a 
more potent virus effect, allowing lower doses of virus to be used to generate 
equivalent inomunity; and a greater cellular-mediated inunune response than 
replication-deficient vectors not comprising this region (basepairs 1-450). Even more 
important, adenoviral constructs derived therefrom are very stable genetically in 

15 large-scale production, particularly those comprising an expression cassette under the 
control of a hCMV promoter devoid of intron A. This is because Applicants have 
surprisingly found that the intron A portion of the hCMV promoter constituted a 
region of instability when employed in adenoviral vectors. Applicants have, 
therefore, identified an enhanced adenoviral vector which is particularly suited for use 

20 in gene therapy and nucleotide-based vaccine- vectors which, favorably, lends itself to 
large scale propagation. 

A preferred embodiment of this invention is a replication-defective adenoviral 
vector in accordance with the above description wherein the gene is inserted in the 
form of a gene expression cassette comprising (a) a nucleic acid encoding a protein or 

25 biologically active and/or immunologically relevant portion thereof; (b) a 

heterologous promoter operatively linked to the nucleic acid of part a); and, (c) a 
transcription terminator. 

In preferred embodunents, the El gene, other than that contained within 
basepairs 1-450 or, alternatively, that contained within base pairs 1-450 and 3511- 

30 3523 has been deleted from the adenoviral vector, and the gene expression cassette 
. has replaced the deleted El gene. In other prefened embodiments, the replication 
defective adenovirus genome does not have a functional E3 gene, or the E3 gene has 
been deleted. Most preferably, the E3 region is present within the adenoviral genome. 
Further preferred embodiments are wherein the gene expression cassette is in an El 

35 anti-parallel (transcribed in a 3' to 5' direction relative to the vector backbone) 
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orientation or, more preferably, an El parallel (transcribed in a 5* to 3* direction 
relative to the vector backbone) orientation. 

Further embodiments relate to a shutfle plasmid vector comprising: an 
adenoviral portion and a plasmid portion, wherein said adenovirus portion comprises: 
5 a) a replication defective adenovirus genome, at least partially deleted in El, 

comprising a wildtype adenovirus c£y-acting packaging region from about base pair 1 
to between from about base pair 342 (more preferably, 400) to about base pair 458 
(preferably, 1-450) of the wildtype adenovirus genome and, preferably, in addition 
thereto, basepairs 3511-3523 of a wildtype adenovirus sequence; and b) a gene 

10 expression cassette comprising: (a) a nucleic acid encoding a protein or biologically 
active and/or immunologically relevant portion thereof; (b) a heterologous promoter 
operatively linked to the nucleic acid of part a);and (c) a transcription t^minator 
and/or a polyadenylation site. 

Other aspects of this invention include a host cell comprising said adenoviral 

15 vectors and/or said shuttle plasmid vectors; vaccine compositions comprising said 
vectors; and methods of producing the vectors comprising (a) introducing the 
adenoviral vector into a host cell which expresses adenoviral El protein, and (b) 
harvesting the resultant adenoviral vectors. 

To this end, the present invention particularly relates to harvested 

20 recombinant, replication defective virus derived from a host cell, such as but not 
limited to 293 cells or PER.C6® cells, including but not limited to harvested vims 
related to any of the MRELA.d5 vector backbones, with or without an accompanying 
transgene, including but not limited to the HTV-l antigens described herein. An 
mV-l vaccine is represented by any harvested, recombinant adenovirus material 

25 which expresses any one or more of the KV-l antigens disclosed herein. This 
harvested material may then be purified, fomiulated and stored prior to host 
administration. 

Another aspect of this invention is a method of generating a cellular immune 
response against a protein in an individual comprising administering to the individual 
30 an adenovirus vaccine vector comprising: 

a) a recombinant, replication defective adenoviral vector, at least partially 
deleted in El, comprising a wildtype adenovirus cw-acting adenovirus packaging 
region from about base pair 1 to between from about base pair 342 (more preferably, 
400) to about base pair 458 (preferably, 1-450) and, preferably in addition tiiereto, 
35 base pairs 351 1-3523 of a wildtype adenovirus sequence, and. 
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b) a gene expression cassette coxnprising:(i) a nucleic acid encoding a protein 
or biologically active and/or immunologically relevant portion thereof; (ii) a 
heterologous promoter operatively linked to the nucleic acid of part a); and (iii) a 
transcription terminator and/or a polyadenylation site. 

In view of the efiBcacious nature of the adenoviral and/or DNA plasmid 
vaccines described herein, the present invention relates to all methodology regarding 
administration of one or more of these adenoviral and/or DNA plasmid vaccines to 
provide effective immunoprophylaxis, to prevent establishment of an HTV-l infection 
following exposure to this virus, or as a post-HTV infection therapeutic vaccine to 
mitigate the acute HTV-l infection so as to result in the establishment of a lower virus 
load with beneficial long term consequences. As discussed herein, such a treatment 
legimen may include a monovalent or multivalent composition, various combined 
modality applications, and/or a primeA>oost regimen to as to optimize antigen 
expression and a concomitant cellular-mediated and/or humoral inunune response 
upon inoculation into a living vertebrate tissue. Therefore, the present invention 
provides for methods of using the adenoviral and/or DNA plasmid vaccines disclosed 
herein within the various parameters disclosed herein as well as any additional 
parameters known in the art, which, upon introduction into manamalian tissue induces 
intracellular expression of the gag, pol and/or nef-based vaccines. 

To this end, the present invention relates in part to methods of generating a 
cellular immune response in a vaccinee, preferably a human vaccinee, wherein the 
individual is given more than one administration of adenovirus vaccine vector, and it 
may be given in a regimen accompanied by the administration of a plasmid vaccine. 
The plasmid vaccine (also referred to herein as a "DNA plasmid vaccine" or "vaccine 
plasmid" comprises a nucleic add encoding a protein or an immunologically relevant 
portion thereof, a heterologous promoter operably linked to the nucleic acid sequence, 
and a transcription terminator or a polyadenylation signal (such as bGH or SPA, 
respectively). There may be a predetermined minimum amount of time separating the 
administrations. The individual can be given a first dose of plasmid vaccine, and then 
a second dose of plasmid vaccine. Alternatively, the individual may be given a first 
dose of adenovirus vaccine, and then a second dose of adenovirus vaccine. In other 
embodiments, the plasmid vaccine is administered first, followed after a time by 
administration of the adenovirus vaccine. Conversely, the adenovirus vaccine may be 
administered first, followed by administration of plasmid vaccine after a time. In 
these embodiments, an individual may be given multiple doses of the same 
adenovirus serotype in either viral vector or plasmid form, or the virus may be of 
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differing serotypes. Jn the alternative, a viral antigen of interest can be fibrst delivered 
via a.viral vaccine other than an adenovirus-based vaccine, and then followed with the 
adenoviral vaccine disclosed. Alternative viral vaccines include but are not limited to 
pox virus and Venezuelan equine encephilitis virus. 
5 The present invention also relates to multivalent adenovirus vaccine 

compositions which comprise Gag» Pol and Nef components described herein; see, 
e.g., Example 29 and Table 25. Such compositions will provide for an enhanced 
cellular immune response subsequent to host administration, particularly given the 
genetic diversity of human MHCs and of circulating virus. Examples, but not 

10 limitations, include MRKAdS-vector based multivalent vaccine compositions which 
provide for a divalent (i.e., gag and nef, gag and pol, or pol and nef components) or a 
trivalent vaccine (i.e., gag, pol and nef components) composition. Such a mutlivalent 
vaccine may be filled for a single dose or may consist of multiple inoculations of each 
individually filled component; and may in addition be part of a prime/boost regimen 

15 with viral or non-viral vector vaccines as introduced in the previous paragraph. To 
this end, preferred compositions are MRKAd5 adenovirus used in combination with 
Multiple, distinct HIV antigen classes. Each H[V antigen class is subject to sequence 
manipulation, fiius providing for a multitude of potential vaccine combinations; and 
such combinations are within the scope of the present invention. The utilization of 

20 such combined modalities vaccine formulation and administration increase the 

probability of eliciting an even more potent cellular immune response when compared 
to inoculation with a single modality regimen. 

The concept of a "combined modality" as disclosed herein also covers the 
alternative mode of administration whereby multiple HTV-l viral antigens may be 

25 ligated into a proper shutfle plasmid for generation of a pre-adenoviral plasmid 
comprising multiple open reading frames. For example, a trivalent vector may 
comprise a gag-pol-nef fusion, in either a E3(-) or E3(+) background, preferably a E3 
deleted backbone, or possibly a "2+1" divalent vaccine, such as a gag-pol fusion (i.e., 
codon optinoized p5S gag and inactivated optimized pol; Example 29 and Table 25) 

30 within the same MRKAd5 backbone, with each open reading frame being operatively 
linked to a distinct promoter and transcription termination sequence. Alternatively, the 
two open reading frames may be operatively linked to a single promoter, with the 
open reading frames operatively linked by an internal ribosome entry sequence 
(IRES). Therefore, a multivalent vaccine delivered as a single, or possibly a second 

35 harvested recombinant, replication-deficient adenovirus is contemplated as part of the 
present invention. 
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Therefore, the adenoviral vaccines and plasmid DNA vaccines of this 
invention may be administered alone, or may be part of a prime and boost 
administration regimen. A mixed modality priming and booster inoculation scheme 
will result in an enhanced immune response, particularly if pre-existing anti-vector 

5 immune responses are present This one aspect of this invention is a method of 
priming a subject with the plasmid vaccine by administering the plasmid vaccine at 
least one time, allowing a predetermined length of time to pass, and then boosting by 
administering the adenoviral vaccine. Multiple primings typically, 1-4, are usually 
employed, although more may be used. The length of time between priming and 

10 boost may typically vary from about four months to a year, but other time frames may 
be used. In experiments with rhesus monkeys, the animals were primed four times 
with plasmid vaccines, then were boosted 4 months later with the adenoviral vaccine. 
Their cellular inmiune response was notably higher than that of animals which had 
only received adenoviral vaccine. The use of a priming regimen may be particularly 

IS preferred in situations where a person has a pre-existing anti-adenovirus immune 
response. 

It is an object of the present invention to provide for enhanced 
replication-defective recombinant adenoviral vaccine vector backbones. These 
recombinant adenoviral backbones may accept one or more transgenes, which may be 
20 passaged through cell culture for growth, amplification and harvest. 

It is a further object to provide for enhanced replication-defective recombinant 
adenoviral vaccine vectors which encode various transgenes. 

It is also an object of the present invention to provide for a harvested 
recombinant, replication-deficient adenovirus which shows enhanced growth and 
25 amplification rates while in combination with increased virus stability after 

continuous passage in cell culture. Such a recombinant adenovims is particularly 
suited for use in gene therapy and nucleotide-based vaccine vectors which, favorably, 
lends itself to large scale propagation. 

To this end, it is an object of the present invention to provide for (1) enhanced 
30 replication-defective recombinant adenoviral vaccine vectors as described herein 
which encode various forms of HIV-1 Gag, HIV-1 Pol, and/or HTV-l Nef, including 
immunologically relevant modifications of HIV-1 Gag, HIV-1 Pol and HTV-l Nef, 
and (2) harvested, purified recombinant replication-deficient adenovirus generated by 
passage of the adenoviral vectors of (1) through one or multiple passages through cell 
35 culture, including but not linaited to passage through 293 cells or PER.C6® ceUs. 
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It is also an object of the present invention to provide for recombinant 
adenovirus harvested by one or multiple passages through cell culture. As relating to 
recombinant adenoviral vaccine vector, this recombinant virus is harvested and 
formulated for subsequent host administration. 
5 It is also an object of the present invention to provide for replication-defective 

adenoviral vectors wherein at least one gene is inserted in the form of a gene 
expression cassette comprising (a) a nucleic acid encoding a protein or biologically 
active and/or immunologically relevant portion thereof; (b) a heterologous promoter 
operatively linked to the nucleic acid of part a); and. (c) a transcription terminator. 

10 It is also an object of the present invention to provide for a host cell 

comprising said adenoviral vectors and/or said shuttle plasmid vectors; vaccine 
compositions comprising said vectors; and methods of producing the vectors 
comprising (a) introducing the adenoviral vector into a host cell which expresses 
adenoviral El protein, and (b) harvesting the resultant adenoviral vectors. 

IS It is a further object of the present invention to provide for methods of gen^ting a 
cellular immune response against a protein in an individual comprising administering 
to the individual an adenovirus vaccine vector comprising a) a replication defective 
adenoviral vector, at least partially deleted in El, comprising a wildtype adenovirus 
cw-acting packaging region from about base pair 1 to between froin about base pair 

20 342 (more preferably, 400) to about 450 (preferably, 1-450) and, preferably, 3511- 
3523 of a wildtype adenovirus sequence, and, b) a gene expression cassette 
comprising:(i) a nucleic acid encoding a protein or biologically active and/or 
inununplogically relevant portion thereof; (ii) a heterologous promoter operatively 
linked to the nucleic acid of part a); and (iii) a transcription terminator and/or a 

25 polyadenylation site. 

It is also an object of the present invention to provide various alternatives for 
vaccine administration regimes, namely administration of one or more adenoviral 
and/or DNA plasmid vaccines described herein to provide effective 
inamunoprophylaxis for uninfected individuals or a therapeutic treatment for HTV 

30 infected patients. Such processes include but are not limited to multivalent HIV-1 
vaccine compositions, various combined modality regimes as well as. various 
prime/boost alternatives. These methods of administration, relating to vaccine 
composition and/or scheduled administration, will increase the probability of eliciting 
an even more potent cellular unmune response when compared to inoculation with a 

35 single modality regimen. 
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As used throughout the specification and claims, the following definitions and 
abbreviations aie used: 

'TEIAART" refers to - highly active antiretroviral therapy 

**first generation" vectors are characterized as being replication-defective. 
5 They typically have a deleted or inactivated El gene region, and preferably have a 
deleted or inactivated E3 gene region as well, 

"AEX" refers to Anion Exchange chromatography. 

•*QPA" refers to Quick PCR-based Potency Assay. 

"bps" refers to basepairs. 
10 "s'' or "str" denotes that the transgene is in the El parallel or "straight" 

orientation. 

*TBMCs" refers to peripheral blood monocyte cells. 
"FL" refers to fuU length. 

"FLgag" refers to a full-length optimized gag gene, as shown in Figure 2. 
15 "Ad5-Flgag" refers to an adenovirus serotype 5 replication deficient virus 

which carries an expression cassette which comprises a full length optimized gag gene 
tmder the control of a CMV promoter. 

"Promoter" means a recognition site on a DNA strand to which an RNA 
polymerase binds. The promoter forms an initiation complex with RNA polymerase 
20 to initiate and drive transcriptional activity. The complex can be modified by 
activating sequences such as enhancers or inhibiting sequences such as silencers. 

"Leader" means a DNA sequence at the 5' end of a structural gene which is 
transcribed along with the gene. This usually results a protein having an N-tenninal 
peptide extension, often referred to as a pro-sequences. 
25 "Ihtron" means a section of DNA occurring in the middle of a gene which 

does not code for an amino acid in the gene product The precursor RNA of the intron 
is excised and is therefore not transcribed into mRNA not translated into protein. 

'*taimunologically relevant" or '^biologically active" means (1) with regards to 
a viral protein, that the protein is capable, upon administration, of eliciting a 
30 measurable immune response within an individual sufficient to retard the propagation 
and/or spread of the virus and/or to reduce the viral load present within the individual; 
or (2) with regards to a nucleotide sequence, that the sequence is capable of encoding 
for a protein capable of the above, 

"Cassette" refers to a nucleic acid sequence which is to be expressed, along 
35 with its transcription and translational control sequences. By changing the cassette, a 
vector can express a different sequence. 
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"bGHpA" refers to the bovine growth hwinone transcription 
terminator/polyadenylation sequence. 

"tPAgag" refers to a fusion between the leader sequence of the tissue 
plasminogen activator leader sequence and an optimized HTV gag gene, as 
5 exemplified in Figure 30A-B, whether in a DNA or adenovirus-based vaccine vector. 

Where utilized, "lA" or "iuact" refers to an inactivated version of a gene (e.g. 

lApol). 

"MCS" is "multiple cloning site". 

In general, adenoviral constructs, gene constmcts are named by reference to 
10 the genes contained therein. For example: 

"Ad5 HIV-1 gag", also referred to as the original HIV-1 gag adenoviral 
vector, is a vector containing a trans^e cassette composed of a hCMV intron A 
promoter, the full lengdi version of die human codon-optimized HTV-l gag gene, and 
the bovine growth hormone polyadsaiylation signal. The transgene was inserted in the 
15 El antiparallel orientation in an El and E3 deleted adenovector. 

"MRK Ad5 HIV-1 gag" also referred to as "MRKAdSgag" or "Ad5gag2" is 
an adenoviral vector taught herein which is deleted of El, comprises basepairs 1-450 
and 3511-3523, and has a human codon-optimized HIV-1 gene in an El parallel 
orientation under the control of a CMV promoter without intron A. The construct 
20 also comprises a bovine growth hormone polyadenylation signal. 

"pVlJnsHIVgag", also referred to as "HIVFLgagPR9901", is a plasmid 
comprising the CMV inmnediate-early QE) promoter and intron A, a full-length 
codon-optimized HTV gag gene, a bovine growth hormone-derived polyadenylation 
and transcriptional termination sequraice, and a minimal pUC backbone. 
25 "pVlJnsCMV(no intron)-FLgag-bGHpA" is a plasmid derived ftom 

pVlJnsHIVgag which is deleted of the intron A portion of CMV and which comprises 
the fuU length HIV gag gene. This plasmid is also referred to as "pVlJnsHIVgag- 
bGHpA", pVlJns-hCMV-FL-gag-bGHpA" and "pVlJnsCMV(no intron) + FLgag + 
bGI^A". 

30 "pVl JnsCMV(no intron)-ELgag-SPA" is a plasmid of the same composition 

as pVlJnsCMV(nQ intron)-FLgag-bGHpA except that the SPA termination sequence 
replaces that of bGHpA. This plasmid is also referred to as "pVl Jns-HIVgag-SPA" 
and pVlJns-hCMV-FLgag-SPA". 

"pddElsplA" is a universal shuttle vector with no expression cassette (i.e., no 

35 promoter or polyA). The vector comprises wildtype adenovirus serotype 5 (Ad5) 
sequences from bp 1 to bp 341 and bp 3524 to bp 5798, and has a multiple cloning 
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site between the Ad5 sequences ending 341 bp and beginning 3524 bp. This plasmid 
is also referred to as the original Ad S shuttle vector. 
•MRKpdelElsplA" or "MRKpdelEl(Pac/pIX/pack450)" or 
•TvIRKpdelEl(Pac/pIX/pack450)Clar' is a universal shuttle vector with no expression 
cassette (i.e. no promoter or polyA) comprising wildtype adenovirus serotype 5 (Ad5) 
sequences from bpl to bp450 and bp 351 1 to bp 5798. The vector has a multiple 
cloning site between the Ad5 sequence ending 450 bp and beginning 3511 bp" This 
shutde vector may be used to insert the CMV promoter and the bGHpA fragments in 
both the straight ("str". or El parallel) orientation or in the opposite (opp. or El 
antiparallel) orientation) 

"MRKpdelEl(Pac/pIXypack450)+CMVmin+BGHpA(str.)" is still another 
shuttle vector which is the modified vector that contains the CMV promoter (no 
intronA) and the bGH)pA fragments. The expression unit containing the hCMV 
promoter (no intron A) and the bovine growth hormone polyadenylation signal has 
been inserted into the shutde vector such that insertion of the gene of choice at a 
unique BglH site will ensure the direction of transcription of the transgene will be Ad5 
HI parallel when inserted into the MRKpAd5(El/E3+)Clal pre-plasmid. This shuttle 
vector, as shown in Figures 22 and 23, was used to insert the respective lApol and 
G2A,LLAA nef genes directly into. 

*'MRKpdelEl-CMV(no intron>'FLgag-bGHpA" is a shuttle comprising Ad5 
sequences from basepairs 1-450 and 3511-5798, with an expression cassette 
containing human CMV without intron A, the full-length human codon-optimized 
HTV gag gene and bovine growth hormone polyadenyladon signal. This plasmid is 
also referred to as "MRKpdelEl shuttle +hCMV-FL-gag-BGHpA" 

•*MRKpAdHVE3+CMV(no intron)-FLgag-bGHpA" is an adenoviral vector 
comprising all AdS sequences except those nucleotides encompassing the El region 
(from 451-3510), a human CMV promoter without intron A, a full-length human 
codon-optunized HIV gag gene, and a bovine growth hormone polyadenylation 
signal. This vector is also referred to as "MElKpAdHVE3 + hCMV-FL-gag- 
BGHpA", "MRKpAd5HIV.lgag", ••MRKpAd5gag". "pMRKAd5gag" or 
"pAd5gag2';. 

"pVlJns-HIV-pol inact(opt)" or "pVlJns-HTV lA pel (opt) is the inactivated 
Pol gene (contained within SEQ ID N0:3) cloned into the BgUI site of VlJns (Figure 
17 A-C). As noted herein, various derivatives of HTV-l pol may be cloned into a 
plasmid expression vector such as VI Ins or VI Jns-tPA, thus serving directly as DNA 
vaccine candidates or as a source for subcloning into an appropriate adenoviral vector. 
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"MRKpdel+hCMVinin+FL-pol+bGHpA(s)" is the 
*'MRKpdemi(Pac/pIX/pack4S0)+CMVmin4*BGHpA(st^^ shuttle mentioned above 
which contaiiis the lA pol gene is the proper orientation. This shuttle vector is used in 
a bacterial recombination with MRKpAd(El-/E3+)Clal. 

5 "MRKpAd+hCMVmin+FL-pol+bGHpA(S)E3+", also referred to herein as 

"pMRKAdSpor, is the pre-adenovirus plasmid which comprises a CMV-pol 
inact(opt)-pGHpA construct. The construction of this pre*-adenovirus plasmid is 
shown in Figure 22. 

"pVlJns/nef (G2A,LLAA)" or "VI Jns/opt nef (G2A,LLAA)" comprises 

10 codon optimized HIV-1 Nef wherein the open reading frame codes for modifications 
at the amino terminal myristylation site (Gly-2 to Ala-2) and substitution of the Leu- 
174-Leu-175 dileucine motif to Ala-174-AIa~175 (SEQ ID NO:13; which comprises 
an initiating methionine residue at nucleotides 12-14 and a **TAA*' stop codon from 
nucleotides 660-662). This fragment is subcloned into the Bgl II site of VlJns 

15 and/orVl Jns-tPA (Figures 16A-B). As noted above for HIV-1 pol. HIV-1 nef 

constructs may be cloned into a plasmid expression vector such as VI Jns or VlJns- 
iPA, thus serving directly as DNA vaccine candidates or as a source for subcloning 
into an appropriate adenoviral vector. 

"MRKpdelElhCMVmiiiFL-nefBGHpA(s)", also referred to herein as 

20 "pMRKAdSnef is the pre-adenovirus plasnoid which comprises a CMV-nef 

(G2A,LLAA) codon optimized sequence. The construction of this pre-adenovirus 
plasmid is shown in Figure 23. 

BRIEF DESCRIPTION OF THE FIGURES 
25 Figure 1 shows the original HTV-l gag adenovector (Ad5HIV-lgag). This 

vector is disclosed in PCT International Application No. PCT/USOO/1 8332 (WO 
01/02607) filed July 3, 2000, claiming priority to U.S. Provisional Application Serial 
No. 60/142,631, filed July 6, 1999 and U.S. AppUcation Serial No. 60/148,981, filed 
August 13, 1999, all three applications which are hereby incorporated by reference. 
30 Figure 2 shows the nucleic acid sequence (SEQ ID NO: 29) of the optimized 

. human HIV-1 gag open reading frame. . . 

Figure 3 shows diagranmiatically the new transgene constructs in comparison 
with the original gag transgene. 

Figure 4 shows the modifications made to the original adenovector backbone 
35 in the generation of the novel vectors of the instant invention. 
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Figure 5 shows the virus mixing expeiiznents that were canied out to 
determine the effects of the addition made to the packaging signal region (Expt #1) 
and the E3 gene on viral growth (Expt. #2). The bars denote fhc region of 
modifications made to the El deletion. 

5 Figure 6 shows an autoradiograph of viral DNA analysis following the viral 

mixing experiments described in Examples 6 and 7. 

Figures 7A, 7B and 7C are as follows: Figure 7A shows the hCMV-Flgag- 
bGHpA adenovectors constructed within the MRKpAdHVE3 and MRKpAdHVO 
adenovector backbones. Both El parallel and El antiparallel transgene orientation are 

10 represented. Figure 7B shows the hCMV-Flgag-SPA adenovectors constructed within 
the MRKpAdHVE3 and MRKpAdHVO adenovector backbones. Again, both El 
parallel and El antiparallel transgene orientation are represented. Figure 7C shows 
the mCMV-Flgag-bGHpA adenovectors constructed within the MRKpAdHVE3 and 
MRKpAdHVO adenovector backbones. Once again, both El parallel and El 

IS antiparallel transgene orientation are represented. 

Figure 8A shows the experiment designed to test the effect of transgene 
r^entation. 

Figure 8B shows the experiments designed to test the effect of 
polyadenylation signal. 
20 Figure 9 shows viral DNA from the four adenoviral vectors tested (Example 

12) at P5, following 5^rEll digestion. 

Figure 10 shows viral DNA analysis of passages 11 and 12 of 
MRKpAdHVES, MRKAdSHTV-lgag, and MRKAdSfflV-lgagES-. 

Figure 11 shows viral DNA analysis (Hindni digestion) of passage 6 
2S MRKpAdHVE3 and MKKAdSHIV-lgag used to initiate the viral competition study. 
The last two lanes are passage 11 analysis of duplicate passages of the competition 
study (each virus at MOI of 280 viral particles). 

Figure 12 shows viral DNA analysis by Hind HI digestion on high passage 
numbers for MRKAdSHIV-lgag in serum-containing media with collections made at 
30 specified times. The first lane shows the Ikb DNA size marker. The other lanes 
represent pre-plasmid control (digested with Pacl and Hindin), MRKAd5HIV-lgag 
atP16,P19, andP2L 

Figure 13 shows serum anti-p24 levels at 3 wks post i jn. immunization of 
balb/c mice (n=:10) with varying doses of several Adgag constructs: (A) MRK Ad5 
35 HIV-1 gag (through passage 5); (B) MRKAdS hCMV-FLgag-bGHpA (E3-); (C) 
MRKAdS hCMV-FLgag-SPA (E3+); (D) MRKAdS mCMV-FLgag-bGHpA (E3+); 
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(E) research lot (293 cell-derived) of AdSHTV-l gag; and (F) clinical lot 
(AdSgagFNOOOl) of Ad5HIV-l gag. Reported are the geometric mean titers (GMT) 
for each cohort along with the standard error bars. 

Figure 14 shows a restriction map of the pMRKAd5HIV-lgag vector. 

5 Figures 15A-X illustrates the nucleotide sequence of the pMRKlAdSEDV-lgag 

vector (SEQ ID NO:27, [coding] and SEQ ID NO:28 [non^oding]). 

Figures 16A-B shows a schematic representation of DNA vaccine expression 
vectors VI Jns (A) and VI Jns-tPA (B), which are utilized for HTV-l gag, pol and nef 
constructs in various DNA/viral vector combined modality regimens as disclosed 

10 herein. 

Figures 17A-C shows the nucleotide (SEQ ID NO:3) and amino acid sequence 
(SEQ ID N0:4) of lA-PoL Underlined codons and amino acids denote mutations, as 
listed in Table 1. 

Figure 18 shows codon optimized nucleotide and amino acid sequences 
1 5 through the fusion junction of tP A-pol inact(opt) (contained within SEQ ID NOs: 7 
and 8, respectively). The underlined portion represents the NH2-terminal region of 
lA-PoL 

Figures 19A-B show a nucleotide sequence comparison between wild type 
nef(jrfl) and codon optimized nef. The wild type nef gene from the jrfl isolate 
20 consists of 648 nucleotides capable of encoding a 216 amino acid polypeptide. WT, 
wild type sequence (SEQ ID NO: 19); opt, codon-optimized sequence (contained 
within SEQ ID NO:l). The Nef amino acid sequence is shown in one-letter code 
(SEQIDN0:2). 

Figures 20A-C show nucleotide sequences at junctions between nef coding 
25 sequence and plasmid backbone of nef expression vectors VI Jns/nef (Figure 20A)» 

VlJns/nef(G2A,LJLAA) (Figure 208), VlJns/tpanef OFigure 20C) and 

VI Jns/tpanef(LLAA) (Figure 20C, also). 5' and 3' flanking sequences of codon 

optimized nef or codon optimized nef mutant genes are indicated by bold/italic letters; 

nef and nef mutant coding sequences are indicated by plain letters. Also indicated (as 
30 underlined) are the restriction endonuclease sites involved in constmction of 

respective nef expression vectors, VlJns/tpanef and VlJns/tpanef(LLAA) have 

identical sequences at the junctions. 

Figure 21 shows a schematic presentation of nef and nef derivatives. Amino 

acid residues involved in Nef derivatives are presented. Glycine 2 and Leucinel74 
35 and 175 are the sites involved in myristylation and dileucine motif, respectively. For 

both versions of the tpanef fusion genes, the putative leader peptide cleavage sites are 



19 



wo 02/022080 



PCT/USOl/28861 



indicated with "**', and a exogenous serine residue introduced during the construction 
of the mutants is underlined. 

Figure 22 shows diagranunatically the construction of the pre-adenovirus 
plasmid construct, MRKAdSPol. 
5 Figure 23 shows diagrammatically the construction of die pre-adenovirus 

plasmid construct, MRKAdSNef . 

Figure 24 shows a comparison of clade B vs. clade C anti-gag T cell responses 
in clade B HIV-infected subjects. 

Figure 25 shows a comparison of clade B vs. clade C anti-nef T cell responses 
10 in clade B HIV-infected subjects. 

' Figures 26A-AO illustrates the nucleotide sequence of the pMRKAdSHTV- 

Ipol adenoviral vector (SEQ ED NO:32 [coding] and SEQ ID NO:33 [non-coding]), 
comprising the coding region of the inactivated pol gene (SEQ ID N03). 

Figures 27 A-AM illustrates the nucleotide sequence of the pMRKAd5HIV-l 
15 nef adenoviral vector (SEQ ID NO:34 [coding] and SEQ ID NO:35 [non-coding]), 
comprising the coding region of the inactivated pol gene (SEQ ID N013). 

Figure 28 shows the stability of MRKAdS vectors comprising various 
promoter fragments (hCMV or mCMV) and terminations signals (bGH or SPA) in 
E3(+) or E3(-) backbones. 
20 Figures 29A and B shows the anion-exchange HPLC viral particle 

concentrations of the freeze-thaw recovered cell associated virus at the 24, 36, 48, and 
60 hpi time points (Figure 29 A) and the timcourse QPA supernatant titers (Figure 
29B) for MRKAd5gag, MRKAdSpol and MRKAdSnef . 

Figure 30 shows the nucleotide sequence (SEQ ID NO:36) and amino acid 
25 sequence (SEQ ID NO:37) comprising the open reading frame of a representative 
fPA-gag fusion for use in the DNA and/or adenoviral vaccine disclosed herein. 

Figure 31 shows the intracellular ylFN staining of PBMCs collected at week 
10 (post DNA prime) and week 30 (post Ad boost). The cells were stimulated 
overnight in the presence or absence of the gag peptide pool. They were subsequently 
30 stained using fluorescence-tagged anti-CD3, anti-CD8, anti-CD4, and anti-ylFN 
. monoclonal antibodies. Eachplot.showsallCD3+TceDs which were segregated in . 
terms of positive staining for surface CDS and ylFN production. The nmnbers in the 
upper right and lower right quadrants of each plot are the percentages of CD3* cells 
that were CDS^yIPN^ CD4"^YlFN^, respectively. 
35 Figure 32 shows a comparison of single-modality adenovirus immunization 

with DNA + adjuvant prime/adenovirus boost immunization. 

20 
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Figures 33A-B show the nucleotide sequence (SEQ ID NO: 38) of the open 
reading frame for the gag-IApol fusion of Example 29. 

Figures 34A-B show the protein sequence (SBQ ID NO:39) of the gag-IApol 
fustion frame. 

5 

DETAILED DESCRIPTION OF THE INVENTION 

A novel replication-defective, or "first generation," adenoviral vector suitable 
for use in gene therapy or nucleotide-based vaccine vectors is described. IWs vector 
is at least partially deleted in El and comprises a wildtype adenovirus cij-acting 

10 packaging region from about base pair 1 to between about base pair 342 (more 
preferably, 400) to about 458 (preferably, 1-450) and, preferably, 3511-3523 of a 
wUd-type adenovirus sequence. It has been found that a vector of this description 
possesses enhanced growth characteristics, with approximately 5-10 fold greater 
amplification rates, and is more potent allowing lower doses of virus to be used to 

IS generate equivalent immunity. The vector, furthermore, generates a harvested 
recombinant adenovirus which shows greater cellular-mediated inmiune responses 
than replication-deficient vectors not comprising this region (basepairs 342-450). 
Adenoviral constructs derived from these vectors are, further, very stable genetically, 
particularly those comprising a transgene under the control of a hCMV promoter 

20 devoid of intron A. Viruses in accordance with this description were passaged 

continually and analyzed; see Example 12. Each virus analyzed maintained it correct 
genetic structure. Analysis was also carried out under propagation conditions similar 
to that performed in large scale production. Again, the vectors were found to possess 
enhanced genetic stability; see Figure 12. Following 21 passages, the viral DNA 

25 showed no evidence of rearrangement, and was highly reproducible from one 
production lot to the next. The outcome of all relevant tests indicate that the 
adenoviral vector is extremely well suited for large-scale production of recombinant, 
replication-deficient adenovirus, as shown herein with the data associated with Figure 
28. 

30 A preferred adenoviral vector in accordance with this description is a vector 

comprising basepairs 1-450, which is deleted in E3. This vector can accommodate up 
to approximately 7,500 base pairs of foreign DNA inserts (or exogenous genetic 
material). Another preferred vector is one retaining E3 which comprises basepairs 1- 
450. A preferred vector of this description is an E3+ vector comprising basepairs 1- 

35 450 and 351 1-3523. This vector, when deleted of the region spanning basepairs 451- 
3510, can accommodate up to approximately, 4,850 base pairs of foreign DNA inserts 
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(or exogenous genetic material). The cloning capacities of the above vectors have 
been determined using 105% of the wildtype AdS sequence as the upper genome size 
limit 

Wildtype adenovirus serotype 5 is used as the basis for the specific basepair 

5 numbers provided throughout the specification. The wildtype adenovirus serotype 5 
sequence is known and described in the art; see, Chroboczek et al, 1992 /. Virology 
186:280, which is hereby incorporated by reference. Accordingly, a particular 
embodiment of the instant invention is a vector based on the adenovirus serotype 5 
sequence. One of skill in the art can readily identify the above regions in other 

10 adenovirus serotypes (e.g., serotypes 2, 4, 6, 12, 16, 17, 24, 31. 33, and 42), regions 
defined by basepairs conesponding to the above basepair positions given for 
adenovirus serotype 5. Accordingly, the instant invention encompasses all adenoviral 
vectors partially deleted in El comprising basepairs corresponding to 1-450 
(particularly, 342-450) and, preferably, 351 1-3523 of a wild-type adenovirus serotype 

15 5 (AdS) nucleic acid sequence. Particularly preferred embodiments of the instant 
invention are those derived from adenoviruses like AdS which are classified in 
subgroup C (e.g., Ad2). 

Vectors in accordance with the instant invention are at least partially deleted in 
El. Preferably the El region is completely deleted or inactivated. Most preferably, 

20 the region deleted of El is within basepairs 451-3510. It is to be noted that the 

extended 5* and 3' regions of the disclosed vectors are believed to effectively reduce 
the size of the El deletion of previous constructs without overlapping any part of the 
El A/EIB gene present in the cell line used, i.e., the PER.C6® cell line transefected 
with base pairs 4S9-3510. Overlap of adenoviral sequences is avoided because of the 

25 possibility of recombination. One of ordinary skill in the art can certainly appreciate 
that the instant invention can, therefore, be modified if a different cell line transfected 
wifli a different segment of adenovirus DNA is utilized. For purposes of 
exemplification, a 5* region of base pairs 1 to up to 449 is more appropriate if a cell 
line is transfected with adenoviral sequence from base pairs 450-3510, This holds 

30 tme as well in the consideration of segments 3' to the El deletion. 

Preferred embodiments of the instant invention possess an intact E3 region . 
(i.e., an E3 gene capable of encoding a functional E3). Alternate embodiments have a 
partially deleted E3, an inactivated E3 region, or a sequence completely deleted of E3. 
Applicants have found, in accordance with the instant invention, that virus comprising 

35 the E3 gene were able to amplify more rapidly compared with virus not comprising an 
E3 gene; see Figure 6 wherein a diagnostic CsCl band conesponding to flie E3+ virus 



22 



wo 02/022080 



PCTAJSOl/28861 



tested (5,665 bp) was present in greater amount compared with the diagnostic band of 
3,010 bp correspondmg to the E3- vims. These results were obtained following a 
virus competition study involvmg mixing equal MOI ratio (1:1) of adenovectors both 
comprising the E3 gene and not comprising the E3 gene. This increased amplification 

5 capacity of the E3+ adenovectors was subsequently confirmed with growth studies; 
see Table 4A, wherein the E3+ virus exhibit amplification ratios of 470, 420 and 320 
as compared with the 115 and 40-50 of the E3- constructs. 

As stated above, vectors in accordance with the instant invention can 
accommodate up to approximately 4,850 base pairs of exogenous genetic material for 

10 an E3+ vector and approximately 7,500 base pairs for an E3- vector. Preferably, the 
insert brings the adenoviral vector as close as possible to a wild-type genomic size 
(e.g., for Ad5. 35,935 basepairs). It is well known that adenovirus amplifies best 
when they are close to their wild-type genomic size. 

The genetic material can be inserted in an El-parallel or an El anti-parallel 

15 orientation, as such is illustrated in Figure 7A. 7B, 7C and Figure 8A. Particularly 
preferred embodiments of the instant invention, have the insert in an El-parallel 
-xientation. Applicants have found, via competition experiments with plasmids 
containing transgenes in differing orientation (Figure 8A), that vector constructs with 
the foreign DNA insert in an El-parallel orientation amplify better and actually out- 

20 compete El-antiparallel-oriented transgenes. Viral DNA analysis of the mixtures at 
passage 3 and certainly at passage 6, showed a greater ratio of the vims carrying the 
transgene in the El parallel orientation as compared with the El anti-parallel version. 
By passage 10, the only viral species observed was the adenovector with the transgene 
in the El parallel orientation for both transgenes tested. 

25 Adenoviral vectors in accordance with the mstant invention are particulariy 

well suited to effectuate expression of desired proteins, one example of which is an 
HIV protein, particularly an HIV fidl length gag protein. Exogenous genetic material 
encoding a protein of interest can exist in the form of an expression cassette. A gene 
expression cassette preferably comprises (a) a nucleic acid encoding a protein of 

30 interest; (b) a heterologous promoter operatively linked to the nucleic acid encoding 
the protein; and (c) a transcription terminator. 

The transcriptional promoter is preferably recognized by an eukaryotic RNA 
polymerase. In a preferred embodiment, the promoter is a "strong" or "efficient" 
promoter. An example of a strong promoter is the immediate early human 

35 cytomegalovirus promoter (Chapman et al, 199 1 NucL Acids 19:3979-3986, which 
is incorporated by reference), preferably without intronic sequences. Most preferred 
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for use within the instant adenovind vector is a human CMV promoter without 
intronic seqeunces, like intron A. Applicants have found that intron A> a portion of 
the human cytomegalovirus promoter (hCMV), constitutes a region of instability for 
adenoviral vectors. CMV without intron A has been found to effectuate (Examples 1- 

5 3) comparable expression capabilities in vitro when driving HTV gag expression and, 
furthermore, behaved equivalently to intron A-containing constructs in Balb/c mice m 
vivo with respect to their antibody and T-cell responses at both dosages of plasmid 
DNA tested (20 jig and 200 ^g). Those skilled in the art will appreciate that any of a 
number of other known promoters, such as the strong immunoglobulin, or other 

10 eukaryotic gene promoters may also be used, including the EFl alpha promoter, the 
murine CMV promoter, Rous sarcoma virus (RS V) promoter, SV40 early/late 
promoters and the beta-actin promoter. 

In preferred embodiments, the promoter may also comprise a regulatable 
sequence such as the Tet operator sequence. This would be extremely useful, for 

IS example, in cases where the gene products are effecting a result other than that 
desired and repression is sought 

Preferred transcription termination sequences present within the gene 
expression cassette are the bovine growth hormone terminator/polyadenylation signal 
(bGHpA) and the short syntiietic polyA signal (SPA) of 50 nucleotides in length, 

20 defined as follows: AATAAAAGATCTTTATTTTCATTAGATCTGTGTGrTGGT- 
TmTGTGTG (SEQ ID NO:26). 

The combination of the CMV promoter (devoid of the intron A region) with 
the BOH terminator is particularly preferred although other promoter/t^minator 
combinations in the context of FG adenovirus may also be used. 

25 Other embodiments incorporate a leader or signal peptide into the transgene. 

A preferred leader is that from the tissue-specific plasminogen activator protein, tPA. 
Examples include but are not limited to the various tPA-gag, tPA-pol and tPA-nef 
adenovirus-based vaccines disclosed throughout this specification. 

In view of the improved adenovirus vectors described herein, an essential 

30 portion of the present invention are adenoviral-based HTV vaccines comprising said 
adenovims backbones which may be administered to a mammalian host, preferably a 
human host, in either a prophylactic or therapeutic setting. The HTV vaccines of the 
present invention, whether administered alone or in combination regimens with other 
viral- or non-viral-based DNA vaccines, should elicit potent and broad cellular 

35 immune responses against HIV that will either lessen the likelihood of persistent virus 
infection and/or lead to the establishment of a clinically significant lowered virus load 
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subject to HIV infection or in combination with HAART therapy, mitigate the effects 
of previously established HTV infection (antiviral immunotherapy(ARI)). While any 
HTV antigen (e.g., gag, pol, nef, gpl60, gp41, gpl20, tat, rev, etc.) may be utilized in 
the herein described recombinant adenoviral vectors, preferred embodiments include 

5 the codon optimized p55 gag antigen (herein exemplified as MRKAdSgag), pol and 
nef. Sequences based on different Clades of HIV-1 are suitable for use in the instant 
invention, most preferred of which are Clade B and Clade C. Particularly preferred 
embodiments are those sequences (especially, codon-optimized sequences) based on 
concensus Clade B sequences. Preferred versions of the MRKAdSpol and 

10 MRKAdSnef series of adenoviral vaccines will encode modified versions of pol or 
nef, as discussed herein. Preferred embodiments of the MRKAd5HIV-l vectors 
carrying HTV envelope genes and modifications thereof comprise the HIV codon- 
optimized env sequences of PCT International Applications PCTAJS97/02294 and 
PCTAJS97/10517, pubUshed August 28, 1997 (WO 97/31115) and December 24, 

15 1997, respectively; both documents of which are hereby incorporated by reference. 

A most preferred aspect of the instant invention is the disclosed use of the 
adenoviral vector described above to effectuate expression of HTV gag. Sequences 
for many genes of many HIV strains are publicly available in GENB ANK and 
primary, field isolates of HIV are available from the National Institute of Allergy and 

20 Infectious EHseases (NIAID) which has contracted with Quality Biological 

(Gaiihersburg, MD) to make these strains available. Strains are also available fix>m 
the World Health Organization (WHO), Geneva Switzerland. It is preferred that the 
gag gene be from an HIV-1 strain (CAM-1; Myers et al, eds. "Human Retroviruses 
and AIDS: 1995, IIA3-IIA19, which is hereby incorporated by reference). This gene 

25 closely resembles the consensus amino acid sequence for the clade B (North 

American/European) sequence. Therefore, it is within the purview of the skilled 
artisan to choose an appropriate nucleotide sequence which encodes a specific HIV 
gag antigen, or immunologically relevant portion thereof. As shown in Example 25, a 
clade B or clade C based p55 gag antigen will potentially be useful on a global scale. 

30 As noted herein, the transgene of choice for insertion in to a DNA or MRKAd-based 
adenoviral vector of the present invention is a codoji optimized version of p55 gag. . 
Such a MRKAdSgag adenoviral vector is documented in Example 11 and is at least 
referred to herein as MRKAd5HIV-lgag. Of course, additional versions are 
contemplated, including but not limited to modifications such as promoter (e.g., 

35 mCMV for hCMV) and/or pA-terminations signal (SPA for bGH) switching, as well 
as generating MRK Ad5 backbones with or without deletion of the Ad5 E3 gene. 
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The present invention also relates a series of MRKAdSpol-based adenoviral 
vaccines which are shown herein to generate cellular immune responses subsequent to 
administration in mice and non-human primate studies. Several of the MRKAdSpol 
series are exemplified herein. One such adenoviral vector is referred to as 
5 MRKAd5hCMV-inact opt pol(E3+), which comprises the MRKAdS backbone, the 
hCMV promoter (no intron A), an inactivated pol transgene, and contains the Ad5 E3 
gene in the adenoviral backbone. A second exemplified pre-adenovirus plasmid and 
concomitant virus is referred to as MRKAdShCMV-inact opt pol(E3-), which is 
identical to the former adenoviral vector except that ttie E3 is deleted Both 

10 constructions contain a codon optimized, inactivated version of HTV-l Pol, wherein at 
least the entire coding region is disclosed herein as SEQ ID NO:3 and the expressed 
protein is shown as SEQ ID N0:4 (see also Figure 17A-C and Table 1, which show 
targeted deletion for inactivated poL This and other preferred codon optimized 
versions of HTV Pol as disclosed herein are essentially as described in U.S. 

15 Application Serial No. 09/745,221, filed December 21, 2000 and PCT International 
AppUcation PCT/USOO/34724, also filed December 21, 2000, both documents which 
are hereby incorporated by reference. As disclosed in the above-mentioned 
documents, the open reading frame for these codon-optimized HIV-1 Pol-based DNA 
vaccines are represented by codon optimized DNA molecules encoding codon 

20 optimized HIV-l Pol (e.g. SEQ ID NO:2), codon optimized HIV-1 Pol fused to an 
amino terminal localized leader sequence (e.g. SEQ ID N0:6), and especially 
preferable, and exemplified by the MRKAd5-Pol constract in e.g.. Example 19, 
biologically inactivated pol ("inact opt Pol"; e.g., SEQ ID NO:4) which is devoid of 
significant PR, RT, RNase or IN activity associated with wild type Pol. In addition, a 

25 construct related to SEQ ID NO:4 is contemplated which coiitains a leader peptide at 
the amino terminal region of the lA Pol protein. A specific construct is ligated within 
an appropriate DNA plasmid vector containing regulatory regions operatively linked 
to the respective HEV-l Pol coding region, with or without a nucleotide sequence 
encoding a functional leader peptide. To this end, various HTV-l Pol constructs 

30 disclosed herein relate to open reading frames for cloning to the enhanced first 
generation Ad vectors of the present invention (such a series of MRKAdSpol 
adenoviral vaccine vectors), including but not limited to wild type Pol (comprising the 
DNA molecule encoding WT opt Pol, as set forth in SEQ ID NO:2), tPA-opt WTPol, 
(comprising the DNA molecule encoding tPA Pol, as set forth in SEQ ID N0:6), inact 

35 opt Pol (comprising the DNA molecule encoding lA Pol, as set forth in SEQ ID 

NO:4), and tPA-inact opt Pol, (comprising the DNA molecule encoding tPA-inact opt 
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Pol, as set forth in SEQ ID NO:8). The pol-based versions of enhanced first 
generation adenovirus vaccines elicit CTL and Th cellular immune responses upon 
administration to the host, including primates and especially humans. As noted in the 
above, an effect of the cellular inamune-directed vaccines of the present invention 
5 should be a lower transmission rate to previously uninfected individuals and/or 

reduction in the levels of the viral loads within an infected individual, so as to prolong 
the asymptomatic phase of HIV-1 infection. 

The present invention further relates to a series of MRKAdSnef-based 
adenoviral vaccines which, similar to HIV gag and pol antigens, generate cellular 
10 immune responses subsequent to administration in mice and non-human primate 
studies. The MRKAdSnef series are exemplified herein by utilizing the improved 
MKK adenoviral backbone in combination with modified versions of HTV nef . These 
exemplified MRKAdSnef vectors are as follows: (1) MRKAdShCMV- 
nef(G2A,LLAA) (E3+), which comprises the improved MRKAdS backbone, a human 
15 CMV promoter an intact Ad5 E3 gene and a modified nef gene: (2) MRKAdSmCMV- 
nef(G2A,LLAA) (E3+), which is the same as (1) above but substituting a murine 
CMV promoter for a human CMV promoter; and (3) MRKAdSmCMV-tpanef (LLAA) 
(E3+), which is the same as (2) except that the nef transgene is tpanefOJLAA). Codon 
optimized versions of fflV-l Nef and HIV-1 Nef modifications are essentially as 
20 described in U.S. AppUcation Serial No. 09/738,782, filed December 15, 2000 and 
PCT International Application PCT/USOO/34162, also filed December 15, 2000, both 
documents which are hereby incorporated by reference. Particular embodiments of 
codon optimized Nef and Nef modifications relate to a DNA molecule encoding HIV- 
1 Nef fi-om the HIV-1 jfrl isolate wherein the codons are optimized for expression in a 
25 mammalian system such as a human. The DNA molecule which encodes this protein 
is disclosed herein as SEQ ID NO:9, while the expressed open reading frame is 
disclosed. herein as SEQ ID NO:10. Another embodiment of Nef-based coding 
regions for use in the adenoviral vectors of the present invention comprise a codon 
optimized DNA molecule encoding a protein containing the human plasminogen 
30 activator (tpa) leader peptide fused with the NH2-terminus of the fflV-l Nef 

polypeptide. The DNA molecule which encodes this protein is disclosed herein as 
SEQ ID NO: 11, while the expressed open reading frame is disclosed herein as SEQ 
ID NO: 12. Another modified Nef optimized coding region relates to a DNA molecule 
encoding optimized HIV-1 Nef wherein the open reading firame codes for 
35 modifications at the amino terminal myristylation site (Gly-2 to Ala-2) and 

substitution of the Leu-174-Leu-175 dileucine motif to Ala-174-Ala-175, herein 
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described as opt nef (G2A, LLAA). The DNA molecule which encodes this protein is 
disclosed herein as SEQ ID NO: 13, while the expressed open reading frame is 
disclosed herein as SEQ ID NO: 14. MRKAdSnef vectors (1) MRKAdShCMV- 
nef(G2A4XAA) (E3+) and (2) MRKAd5niCMV-nef(G2AXLAA) (E3+) contain this 

5 transgene. An additional embodiment relates to a DNA molecule encoding optimized 
HTV-l Nef wherein the amino terminal myristylation site and dileucine motif have 
been deleted, as well as comprising a tPA leader peptide. This DNA molecule, opt 
tpanef (LLAA), comprises an open reading frame which encodes a Nef protein 
containing a tPA leader sequence fused to amino acid residue 6-216 of HTV-l Nef 

10 (jfrl), wherein Leu-174 and Leu-175 are substituted with Ala-174 and Ala-175, herein 
referred to as opt tpanef (LLAA) is disclosed herein as SEQ ID NO: 15, while the 
expressed open reading frame is disclosed herein as SEQ ID N0:16. The 
MRKAdSnef vector "MRKAd5mCMV-tpanef(LLAA) (E3+)" contains this transgene. 
Along with the improved MRKAdSgag adenovirus vaccine vector described 

15 herein, generation of a MRKAd5pol and MRKAdSnef adenovirus vector provide for 
enhanced HIV vaccine capabilities. Namely, the generation of this trio of adenoviral 
vaccine vectors, aU shown to generate effective cellular immune responses subsequent 
to host administration, provide for the ability to administer these vaccine candidates 
not only alone, but preferably as part of a divalent (i.e., gag and nef, gag and pol, or 

20 pol and nef components) or a trivalent vaccine (i.e., gag, pol and nef components). 
Therefore, a preferred aspect of the present invention are vaccine formulations and 
associated methods of administration and concomitant generation of host cellular 
immune responses associated with formulating three separate series of MRKAd5- 
based adenoviral vector vaccines. Of course, this MRKAdS vaccine series based on 

25 distinct HIV antigens promotes expanded opportunities for formulation of a divalent 
or trivalent vaccine, or possibly administration of separate formulations of one or 
more monovalent or divalent formulations within a reasonable window of time. It is 
also within the scope of the present invention to embark on combined modality 
regimes which include multiple but distinct components from a specific antigen. An 

30 example, but certainly not a limitation, would be separate MRKAdSpol vectors, with 
one vaccine vector expressing wild type Pol (SEQ ID N0:2) and another 
MRKAdSpol vector expressing inactivated Pol (SEQ ID N0:6), Another example 
might be separate MRKAdSnef vectors, with one vaccine vector expressing the 
tPA/LLAA version of Nef (SEQ ID NO: 16) and another MRKAdSnef vector 

35 expressmg the G2A,LLAA modified version of Nef (SEQ ID NO: 14). Therefore, the 
MRKAdS adenoviral vectors of the present invention may be used in combination 
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with multiple, distinct HIV antigen classes. Each HIV antigen class is subject to 
sequence manipulation, thus providing for a multitude of potential vaccine 
combinations; and such combinations are within the scope of the present invention. 
The utilization of such combined modalities vaccine formulation and administration 

5 increase the probability of eliciting an even more potent cellular immune response 
when compared to inoculation with a single modality regimen. 

The present invention also relates to application of a mono-, dual-, or tri- 
modality administration regime of the MRKAd5gag, pel and nef adenoviral vaccine 
series in a prime/boost vaccination schedule. This prime/boost schedule may include 

10 any reasonable combination of the MRKAdSgag, pol and nef adenoviral vaccine 

series disclosed herein. In addition, a primeAjoost regime may also involve other viral 
and/or non-viral DNA vaccina* A preferable addition to an adenoviral vaccine 
vector regime includes but is not limited to plasmid DNA vaccines, especially DNA 
plasmid vaccines that contain at least one of the codon optimized gag, pol and nef 

IS constructions, as disclosed herein. 

Therefore, one aspect of this invention is the administration of the adenoviral 
vector containing the optimized gag gene in a primeAjoost regiment in conjunction 
with a plasmid DNA encoding gag. To distinguish this plasmid from the adenoviral- 
containing shuttle plasmids used in the construction of an adenovirus vector, this 

20 plasmid will be referred to as a "vaccine plasmid" or "DNA plasmid vaccine". 
Preferred vaccine plasmids for use in this administration protocol are disclosed in 
pending U.S. patent appUcation 09/017,981, filed Febniary 3, 1998 and WO98/34640. 
published August 13, 1998, both of which are hereby incorporated by reference. 
Briefly, the preferred vaccine plasmid is designated VlJns-FLgag, which expresses 

25 the same codon-optimized gag gene as the adenoviral vectors of this invention (see 
Hgure 2 for the nucleotide sequence of the exemplified optimized codon version of 
full length p55 gag). The vaccine plasmid backbone, designated VI Jns contains the 
CMV immediate-early (IE) promoter and intron A, a bovine growth hormone-derived 
polyadenylation and transcription termination sequence as the gene expression 

30 regulatory elements, and a minimal pUC backbone; see Montgomery et al., 1993, 
DNA Cell Biol 12:777-783. Hie pUC sequence permits high levels of plasmid 
production in E. coli and has a neomycin resistance gene in place of an ampicillin 
resistance gene to provide selected growth in the presence of kanamycin. 
Alternatively, a vaccine plasmid which has the CMV promoter deleted of intron A can 

35 be used. Those of skill in the art will recognize that alternative vaccine plasmid 
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vectors may be easily substituted for these specific constructs, aiid this invention 
specifically envisions use of such alternative plasmid DNA vaccine vectors. 

Another aspect of the present invention is a prime/boost regimen which 
includes a vaccine plasmid which encodes an HTV pol antigen, preferably a codon 
5 optimized form of pol and also preferably a vaccine plasmid which comprises a 
nucleotide sequence which encodes a Pol antigen selected fix)m the group of Pol 
antigens as shown in SEQ ID NOs: 2, 4, 6 and 8, The variety of potential DNA 
plasmid vaccines which encode various biologically active fonns of HTV-l Pol, 
wherein administration, intracellular delivery and expression of the HTV-l Pol gene of 

10 interest elicits a host CTL and Th response. The preferred synthetic DNA molecules 
of the present invention encode codon optimized wild type Pol (without Pro activity) 
and various codon optimized inactivated HTV-l Pol proteins. The HIV-1 pol open 
reading disclosed herein are especially preferred for pharmaceutical uses, especially 
for human administration as delivered via a recombinant adenoviral vaccine, 

15 especially an enhanced first generation recombinant adenoviral vaccine as described 
herein. Several embodiments of this portion of the invention are provided in detail 
below, namely DNA molecules which comprise a HTV-l pol open reading frame, 
whether encoding full length pol or a modification or fusion as described herein, 
wherein the codon usage has been optimized for expression in a mammal, especially a 

20 human. Again, these DNA sequences are positioned appropriately within a 

recombinant adenoviral vector, such as the exemplified recombinant adenoviral vector 
described herein, so as to promote expression of the respective HIV-1 Pol gene of 
interest, and subsequent to administration, elicit a host CTL and Th response. Again, 
these preferred, but iii no way liroiting, pol genes are as disclosed herein and 

25 essentially as described in U.S . Application Serial No. 09/745,221, filed December 
21, 2000 and PCT International AppHcation PCT/USOO/34724, also filed December 
21, 2000, both documents which are hereby incorporated by reference. 

A third series of vaccine plasmids which are useful in a combined modality 
and/or prime/boost regimen are vaccine plasmids which encode an HTV nef antigen or 

30 biologically and/or immunologically relevant modification thereof. As noted 

elsewhere, preferred vaccine plasmids contain a codon optimized form of nef and also 
preferably comprise a nucleotide sequence which encodes a Nef antigen selected firom 
the group of Nef antigens as shown in SEQ ID NOs: 10, 12, 14 and 16. These 
preferred nef coding regions are disclosed herein, as well as being described in U.S. 

35 Application Serial No. 09/738,782, filed December 15, 2000 and PCT International 
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AppUcation PCT/USOO/34162, also filed December 15, 2000, both documents which 
aie hereby incorporated by reference. 

Therefore, the adenoviral vaccines and plasmid DNA vaccines of this 
invention may be administered alone, or may be part of a prime and boost 

5 administration regimen. A mixed modality priming and booster inoculation scheme 
will result in an enhanced immune response, particularly is pre-existing anti-vector 
immune responses are present. This one aspect of this invention is a method of 
priming a subject with the plasmid vaccine by administering the plasmid vaccine at 
least one time, allowing a predetermined length of time to pass, and then boosting by 

10 administering the adenoviral vaccine. Multiple primings typically, 1-4, are usually 
employed, although more may be used. The lengdi of time between priming and 
boost may typically vary from about four months to a year, but otiier time frames may 
be used. In experiments with rhesus monkeys, the animals were primed four times 
with plasmid vaccines, then were boosted 4 months later with the adenoviral vaccine. 

15 Their cellular immune response was notably higher than that of animals which had 
only received adenoviral vaccine. The use of a priming regimen may be particularly 
preferred in situations where a person has a pre-existing anti-adenovirus immune 
response. 

Furthennore and in the altemative, multiple HTV-l viral antigens, such as the 
20 MRKAdS adenoviral vaccines disclosed herein, may be ligated into a proper shuttle 
plasmid for generation of a pre-adenoviral plasmid comprising multiple open reading 
frames. For example a trivalent vector may comprise a gag-pol-nef fusion, in either a 
E3(-) or E3(+) background, preferably a E3 deleted backbone, or possible a "2+1" 
divalent vaccine, such as a gag-pol fusion (i.e., codon optimized p55 gag and 

25 inactivated optimized pol; Example 29 and Table 25) within the same MRKAdS 
backbone, with each open reading frame being operatively linked to a distinct 
promoter and transcription termination sequence. Alternatively, the two open reading 
frames may be operatively linked to a single promoter, with the open readmg frames 
operatively linked by an internal ribosome entry sequence (IRES), as disclosed in 

30 Intemational Publication No. WO 95/24485, which is hereby incorporated by 
reference. Figure 9 shows that the use of multiple promoters and termination 
sequences provide for similar growth properties, while Figure 28 shows that tiiese 
MRKAdSgag-based vectors are also stable at least tiirough passage 21. In tiie absence 
of tiie use of IRES-based technology, it is preferred that a distinct promoter be used to 

35 support each respective open reading frame, so as to best preserve vector stability. As 
examples, and certainly not as limitations, potential multiple transgene vaccines may 
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include a three transgene vector such as hCMV-gagpol-bGHpA + mCMV-nef-SPA in 
an E3 deleted backbone or hCMV-gagpol-bGHpA + mCMV.nef-SPA(E3+), 
Potential "2+1" divalent vaccines^of the present invention might be a hCMV-gag- 
bGHpA + mCMV-nef-SPA in an E3+ backbone (vector #1) in combination with 

5 hCMV-pol-bGHpA in an E3+ backbone (vector #2), with all transgenes in the El 
parallel orientation. Fusion constructs other than the gag-pol fusion described above 
are also suitable for use in various divalent vaccine strategies and can be composed of 
any two HIV antigens fused to one another (e.g.,, nef-pol and gag-nef). These 
adenoviral compositions are, as above, preferably delivered along with an adenoviral 

10 composition comprising an additional HTV antigen in order to diversify the immune 
response generated upon administration. Therefore, a multivalent vaccine delivered 
in a single, or possible second, adenoviral vector is certainly contemplated as part of 
the present invention. Again, this mode of administration is another example of 
whereby an efficaceous adenovims-based HTV-l vaccine may be administered via a 

15 combined modality regime. It is important to note, however, that in terms of deciding 
on an insert for the disclosed adenoviral vectors, due consideration must be dedicated 
to the effective packaging limitations of the adenovirus vehicle. Adenovirus has been 
shown to exhibit an upper cloning capacity limit of approximately 105% of the 
wildtype Ad5 sequence. 

20 Regardless of the gene chosen jfor expression, it is preferred that the sequence 

be "optimized" for expression in a human cellular environment. A "triplet" codon of 
four possible nucleotide bases can exist in 64 variant forms. That these fonns provide 
the message for only 20 different amino acids (as well as transcription initiation and 
termination) means that some amino acids can be coded for by more than one codon. 

25 Indeed, some amino acids have as many as six "redundant", alternative codons while 
some others have a single, required codon. For reasons not compietely understood, 
alternative codons are not at all uiuforaily present in the endogenous DNA of 
differing types of cells and there appears to exist variable natural hierarchy or 
"preference" for certain codons in certain types of cells. As one example, the amino 

30 acid leucine is specified by any of six DNA codons including CTA, CTC, CTG, CTT, 
TTA, and TTG (which correspond, respectively, to the mRNA codons, CUA, CUC, 
CUG, CUU, UUA and UUG). Exhaustive analysis of genome codon frequencies for 
microorganisms has revealed endogenous DNA of E. coli most commonly contains 
die CTG leucine-specifying codon, while the DNA of yeasts and slime molds most 

35 conmionly includes a TTA leucine-specifying codon. In view of this hierarchy, it is 
generally held that the likelihood of obtaining high levels of expression of a leucine- 
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rich polypeptide by an E. coli host wiU depend to some extent on the ftequency of 
codon use. For example, a gene rich in TTA codons will in all probaWlity be poorly 
expressed in E. coli. whereas a CTG rich gene wiU probably highly express the 
polypeptide. Similarly, when yeast cells are the projected transformation host cells 
5 for expression of a leucine-rich polypeptide, a preferred codon for use in an inserted 
DNA would be TTA. 

The implications of codon preference phenomena on recombinant DNA 
techniques are manifest, and the phenomenon may serve to explain many prior 
failures to achieve high expression levels of exogenous genes in successfully 
10 transforaied host organisms-a less "preferred" codon may be repeatedly present in the 
inserted gene and the host ceU machinery for expression may not operate as 
efficiently. This phenomenon suggests that synthetic genes which have been designed 
to include a projected host ctWs preferred codons provide a preferred form of foreign 
genetic material for practice of recombinant DNA techniques. Thus, one aspect of 
15 this invention is an adenovirus vector or adenovirus vector in some combination with 
a vaccine plasmid where both specifically include a gene which is codon optimized 
for expression in a human ceUular environment. As noted herein, a preferred gene 
for use in the instant invention is a codon-optimized HIV gene and, particularly. HIV 
gag, pol or nef . 

20 Adenoviral vectors in accordance with tiie instant invention can be constructed 

using known techniques, such as those reviewed in Hitt et al. 1997 "Human 
Adenovirus Vectors for Gene Transfer into Mammalian Cells" Advances in 
Plmmacology 40:137-206. which is hereby incorporated by reference. 

In consttiicting die adenoviral vectors of this invention, it is often convenient 
25 to insert them into a plasmid or shutUe vector. These techniques are known and 
described in Hitt et al., suprcu This invention specificaUy includes both the 
adenoviras and the adenovirus when inserted into a shuttle plasmid. 

Preferred shuttle vectors contain an adenoviral portion and a plasmid portion. 
The adenoviral portion is essentially the same as the adenovirus vector discussed 
30 supra, containing adenoviral sequences (with non-functional or deleted El and E3 

regions) and the gene expression cassette, flanked by convenient restriction sites. The 
plasmid portion of the shutUe vector often contains an antibiotic resistance marker 
under transcriptional conti-ol of a prokaryotic promoter so that expression of tiie 
antibiotic does not occur in eukaryotic cells. Ampicillin resistance genes, neomycin 
35 resistance genes and other pharmaceutically acceptable antibiotic resistance markers 
may be used. To aid in die high level production of the polynucleotide by 
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feimentation in prokaryotic organisms, it is advantageous for the shuttle vector to 
contain a prokaryotic origin of replication and be of high copy number. A number of 
commercially available prokaryotic cloning vectors provide these benefits. It is 
desirable to remove non-essential DNA sequences. It is also desirable that the vectors 
5 not be able to replicate in eukaryotic cells. This minimizes the risk of integration of 
polynucleotide vaccine sequences into the recipients' genome. Tissue-specific 
promoters or enhancers may be used whenever it is desirable to limit expression of the 
polynucleotide to a particular tissue type. 

In one embodiment of this invention, the pre-plasmids (e.g., pMRKAdSpol, 

10 pMRKAdSnef and pMRKAdSgag were generated by homologous recombination 
using the MRKHVE3 (and MRKHVO for the E3- version) backbones and the 
appropriate shuttle vector, as shown for pMRKAdSpoI in Figure 22 and for 
pMRKAdSnef in Figure 23. The plasmid in linear form is capable of replication after 
entering the PER.C6® cells and virus is produced. The infected cells and media were 

15 harvested after viral replication was complete. 

Viral vectors can be propagated in various El complementing cell lines, 
including the known cell lines 293 and PER.C6®. Both these cell lines express the 
aaenoviral El gene product. PER.C6® is described in WO 97/00326 (published 
January 3, 1997) and issued U.S. Patent No. 6,033,908, both of which are hereby 

20 incorporated by reference. It is a primary human retinoblast cell line transduced with 
an El"^ejrie segment that complements the production of replication deficient (FG) 
adenovirus, but is designed to prevent generation of replication competent adenovirus 
by homologous recombination. Cells of particular interest have been stably 
transformed with a transgene that encodes the AD5E1 A and ElB gene, like PER.C6®, 

25 from 459 bp to 3510 bp inclusive. 293 cells are described in Graham et al., 1977 /. 
GerL Virol 36:59-72, which is hereby incorporated by reference. As stated above, 
consideration must be given to the adenoviral sequences present in the 
complementing cell line used. It is important that the sequences not overlap with that 
present in the vector if the possibility of recombination is to be minimized. 

30 It has been found that vectors generated in accordance with the above 

description are more effective in inducing an inomune response and, thus, constitute 
very promising vaccine candidates. More particularly, it has been found that first 
generation adenoviral vectors in accordance with the above description carrying a 
codon-optimized HIV gag gene, regulated with a strong heterologous promoter can be 

35 used as human anti-HIV vaccines, and are capable of inducing inmiune responses. 
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Standard techniques of molecular biology for preparing and purifying DNA 
constructs enable the preparation of the DNA immunogens of this invention. 

A vaccine composition comprising an adenoviral vector in accordance Svith 
the instant invention may contain physiologically acceptable components, such as 

5 buffer, normal saline or phosphate buffered saline, sucrose, other salts and 

polysorbate. One preferred formulation has: 2.S-10 mM TRIS buffer, preferably 
about 5 mM TRIS buffer. 25-100 mM NaCl, preferably about 75 mM NaCl; 2.5-10% 
sucrose, preferably about 5% sucrose; 0.01 -2 mM MgCh; and 0.001%-0.01% 
polysorbate 80 (plant derived). The pH should range from about 7.0-9.0, preferably 

10 about 8.0. One skilled in the art will appreciate that other conventional vaccine 
excipients may also be used it make the formulation. The preferred formulation 
contains 5mM TRIS, 75 mM NaCl, 5% sucrose, ImMMgCh, 0.005% polysorbate 80 
at pH 8.0 This has a pH and divalent cation composition which is near the optimum 
for Ad5 stability and minimizes the potential for adsorption of virus to a glass surface. 

15 It does not cause tissue irritation upon intramuscular injection. It is preferably frozen 
until use. 

The amount of adenoviral particles in the vaccine composition to be 
introduced into a vaccine recipient will depend on the strength of the transcriptional 
and translational promoters used and on the immunogenicity of the expressed gene 

20 product. Tn general, an immunologically or prophylactically effective dose of 1x10^ 
to 1x10^^ particles and preferably about ixlO*^ to IxlO" particles is administered 
directly into muscle tissue. Subcutaneous injection^ intradermal introduction, 
impression through the skin, and other modes of administration such as 
intraperitoneal, intravenous, or inhalation delivery are also contemplated. It is also 

25 contemplated that booster vaccinations are to be provided. Following vaccination 

with HTV adenoviral vector, boosting with a subsequent HIV adenoviral vector and/or 
plasmid may be desirable. Parenteral administration, such as intravenous, 
intramuscular, subcutaneous or other means of administration of interleukin-12 
protein, concurrently with or subsequent to parenteral introduction of the vaccine 

30 compositions of this invention is also advantageous. 

The adenoviral vector and/or vaccine plasmids of this invention 
polynucleotide may be unassociated with any proteins, adjuvants or other agents 
which impact on the recipients' immune system. In this case, it is desirable for the 
vector to be in a physiologically acceptable solution, such as, but not limited to, sterile 

35 saline or sterile buffered saline. Alternatively, the vector may be associated with an 
adjuvant known in the art to boost immune responses (i.e., a ''biologically effective" 
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adjuvant), such as a protein or other carrier. Vaccine plasmids of this invention may, 
for instance, be delivered in saline (e.g., PBS) with or without an adjuvant Preferred 
adjuvants are Alum or CRL1005 Block Copolymer Agents which assist in the 
cellular uptake of DNA. such as, but not limited to, calcium ions, may also be used to 
5 advantage. These agents are generally referred to herein as transfection facilitatmg 
reagents and phannaceutically acceptable carriers. Techniques for coating 
microprojectiles coated with polynucleotide are known in the art and are also useful in 
connection with this invention. 

This invention also includes a prime and boost regimen wherein a first 

10 adenoviral vector is administered, then a booster dose is given. The booster dose may 
be repeated at selected time intervals. Alternatively, a preferred inoculation scheme 
comprises priming with a first adenovirus serotype and then boosting with a second 
adenovirus serotype. More preferably, the inoculation scheme comprises priming 
with a first adenovirus serotype and then boosting with a second adenovirus serotype, 

15 wherein the first and second adenovirus serotypes are classified within separate 

subgroups of adenoviruses. The above prime/boost schemes are particularly preferred 
in those situations where a preexisting immunity is identified to the adenoviral vector 
of choice» In this type of scheme, the individual or population of individuals is 
primed with an adenovirus of a serotype other than that to which the preexisting 

20 immunity is identified. This enables the first adenovirus to effectuate sufficient 

expression of the transgene while evading existing inmiunity to the second adenovirus 
(the boosting adenovirus) and, further, allows for the subsequent delivery of the 
transgene via the boosting adenovirus to be more effective. Adenovirus serotype 5 is 
one example of a virus to which such a scheme might be desirable. In accordance 

25 with this invention, therefore, one might decide to prime with a non-group C 

adenovirus (e.g., Adl2, a group A adenovirus, Ad24, a group D adenovirus, or Ad35, 
a group B adenovirus) to evade anti-AdS immunity and then boost with Ad5, a group 
C adenovirus. Another preferred embodiment involves administration of a different 
adenovirus (including non-human adenovirus) vaccine followed by administration of 

30 the adenoviral vaccines disclosed. In the alternative, a viral antigen of interest can be 
. first delivered via a viral vaccine other than an adenovirus-based vaccine, and then 
followed with the adenoviral vaccine disclosed. Alternative viral vaccines include but 
are not limited to pox virus and Venezuelan equine encephilitis virus. 

A large body of human and animal data supports the importance of cellular 

35 immune responses, especially CTL in controlling (or eliminating) HIV infection. In 
humans, very high levels of CTL develop following primary infection and correlate 
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with the control of viremia. Several small groups of individuals have been described 
who are repeatedly exposed to HIV by remain uninfected; CTL has been noted in 
several of these cohorts. In the SIV model of HIV infection, CTL similarly develops 
following primary infection, and it has been demonstrated that addition of anti-QD8 
monoclonal antibody abrogated this control of infection and leads to disease 
progression. This invention uses adenoviral vaccines alone or in combination with 
plasnoud vaccines to induce CTL. 

The following non-limiting Examples are presented to better illustrate the 
invention. 

EXAMPLE 1 

Removal of the Intron A Portion of the hCMV Promoter 
GMP grade pVUnsinVgag was used as the starting material to amplify the 
hCMV promoter. PVUnsHIVgag is a plasmid comprising the CMV immediate-early 
ODE) promoter and intron A, a full-length codon-optimized HIV gag gene, a bovine 
growth hormone-derived polyadenylation and transcriptional termination sequence, 
•*nd a minimal pUC backbone; see Montgomery et al, supra for a description of the 
plasmid backbone. The amplification was performed with primers suitably positioned 
to flank the hCMV promoter. A 5' primer was placed upstream of the Mscl site of 
the hCMV promoter and a 3' primer (designed to contain the BgZII recognition 
sequence) was placed 3' of the hCMV promoter. The resulting PGR product (using 
high fidelity Taq polymerase) which encompassed the entire hCMV promoter (minus 
intron A) was cloned into TOPO PGR blunt vector and then removed by double 
digestion with Mscl and BgZII. This fragment was then cloned back into the original 
CMP grade pVlJnsHIVgag plasmid from which the ori^nal promoter, intron A, and 
the gag gene were removed following Mscl and BgJSi digestion. This ligation 
reaction resulted in the construction of a hCMV promoter (minus intron A) + bGHpA 
expression cassette within the original pVlJnsHTVgag vector backbone. This vector 
is designated pVnnsCMV(no intron). 

The FLgag gene was excised from pVl JnsHTVgag using SgZH digestion and 
the 1,526 bp gene was gel purified and cloned into pVlJnsCMV(no intron) at the 
BgUL site. Colonies were screened using Snml restriction enzymes to identify clones 
that carried the Flgag gene in the correct orientation. This plasmid, designated 
pVl JnsCMV(no intron)-FLgag-bGHpA, was fully sequenced to confirm sequence 
integrity. 



37 



wo 02/022080 PCT/USO 1/28861 



Two additional transgenes were also constructed. The plasnaid, 
pVlJnsCMV(no intron)-FLgag-SPA, is identical to pVlJnsCMV(no intron>FLgag- 
bGHpA except that the bovine growth honnone polyadenylation signal has been 
replaced with a short synthetic polyA signal (SPA) of 50 nucleotides in length. The 
5 sequence of the SPA is as shown, with the essential components (poly(A) site, (QT)n, 
and (T)n; respectively) underlined: 

AATA AA AGATCTTTATrrrCATTAGATC TGTGTG TTGGTTTTTTGTGTG 
(SEQIDNO:18). 

The plasmid, pVl Jns-mCMV-FLgag-bGHpA, is identical to the 
10 pV 1 JnsCMV(no intron)-ELgag-bGHpA except that the hCMV promoter has been 
removed and replaced with the murine CMV (mCMV) promoter. 

Figure 3 diagranmiatically shows the new transgene constructs in comparison 
with the original transgene* 

15 EXAMPI£2 

Gag Expression Assay for Modified Gag Transgenes 
Gag Elisa was performed on culture supematants obtained from transient 
tissue culture transfection experiments in which the two new hCMV-containing 
plasmid constructs, pVlJnsCMV(no intron)-FLgag-bGHpA and pVlJnsCMV(no 

20 intron)-FLgag-SPA, both devoid of intron A, were compared to pVl JnsHIVgag 

which, as noted above possesses the intron A as part of the hCMV promoter. Table 2 
below shows the in vitro gag expression data of the new gag plasmids compared with 
the GMP grade original plasmid. The results displayed in Table 2 show that both of 
the new hCMV gag plasmid constructs have expression capacities comparable to the 

25 original plasmid construct which contains the intron A portion of the hCMV 
promoter. 
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Table 2: In vitro DNA transfection of original and new plasmid HIV-1 gag constmcts. 



Plasmid 


tiR KaR/10e6 COS cells/Sus DNA/48 hr 


HIVFL-EaRPR9901' 


10.8 


PVUns-hCMV-FLEag-bGHoA*' 


16.6 


pVlIns-hCMV-FLeaB-SPA'* 


12.0 



*GMP grade pVlJns-hCMVintronA-FLgag-bGHpA. 

^ New plasmid constructions that have the intron A portion removed from the hCMV 
promoter. 

In this constmct the bGH terminator has been replaced with the short synthetic 
polyadenylation signal (SPA) 



EXAMPLES 
Rodent (Balb/c) Study for Modified gag Transgenes 
A rodent study was performed on the two new plasmid constmcts 
^escribed above - pVlJnsCMV(no intron>FLgag-bGHpA and pVlJnsCMV(no 
intron)-FLgag-SPA - in order to compare them with the constmct described above 
possessing the intron A portion of the CMV promoter. pVlJnsHIVgag. Gag antibody 
and Elispot responses (described in PCT International Application No. 
PCT/USOO/18332 (WO 01/02607) filed July 3, 2000, claiming priority to U.S. 
Provisional Application Serial No. 60/142,631, filed July 6, 1999 and U.S. 
Application Serial No. 60/148,981, filed August 13, 1999, all three applications which 
are heieby incorporated by reference) were measured. The results displayed in Table 
3 below, show that the new plasmid constructs behaved equivalently to the original 
. construct in Balb/c mice with respect to their antibody and T-cell responses at both 
dosages of plasmid DNA tested. 20 \ig and 200 \ig. 
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EXAMPUE4 

Table 3: HIV1 91: Immunogeniclty of VUns-gag under different promoter and tennination control 
elements. 



DNA' 


Dose, 
ug** 




Anti-p24 Titers 
(SWkPDI)^ 


SFC/IO'^e Cells 
(4WkPD1)*' 


PromoterAermlnator 




GMT 


+SE 


-SE 


Media 


gag197-205 


p24 


HIVFL-gagPR9901 
(GMP grade) 


200 
20 


12800 
5572 


4652 
1574 


3412 
1227 


2(2) 
0 


129(19) 
56(9) 


30(11) 
25(6) 


pVUns-hCMV- 
FL-gag-bGHpA 


200 
20 


11143 
7352 


2831 
2808 


2257 
2032 


0 
0 


98(5) 
73(9) 


12(6) 
11(6) 


pVIJns-hCMV* 
FL-gag-SPA 


200 
20 


16890 
5971 


5815 
5361 


4326 
2825 


1(1) 
0 


94(4) 
' 85(17) 


26(7) 
38(10) 


Narve 


0 


123 


50 


36 


0 


0 


0 



um. Injections into both quads, 50 per quad 
*^nBl0;GMT, geometric mean titan SE, standard, error 

''n^^, pooled spleens; mean of triplicate wells and standard, deviation. In parentheses; 

Construction of the Modified Shuttle Vector -**MRKpdelEl Shuttle" 
The modifications to the original Ad5 shutfle vector (pdelElspl A; a vector 
comprising Ad5 sequences fi"om basepairs 1-341 and 3524-5798, with a multiple 
5 cloning region between nucleotides 341 and 3524 of Ad5, included the following 
three manipulations earned out in sequential cloning steps as follows: 
(1) The left TTR region was extended to include the Pacl site at the junction between 
the vector backbone and the adenovirus left ITR sequences. This allow for easier 
manipulations using the bacterial homologous recombination system. 
10 (2) The packaging region was extended to include sequences of the wild-type (WT) 
adenovirus from 342 bp to 450 bp inclusive. 

(3) The area downstream of pK was extended 13 nucleotides (i.e., nucleotides 3511- 
3523 inclusive). 

These modifications (Figure 4) effectively reduced flie size of the El deletion without 
15 overlapping with any part of tiie ElA/ElB gene present in the tiansfonned PER.C6® 
cell line. .All manipulations were performed by modifying the Ad shuttle vector 
pdelElsplA. 

Once the modifications were made to the shuttle vector, the changes were 
incorporated into the original Ad5 adenovector backbones (pAdHVO and pAdHVE3) 
20 by bacterial homologous recombination using E. coli B J5 1 83 chemically competent 
cells. 
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EXAMPLES 

rrtnstiuction of Modified Adenovector Backb ones (E3+ and ESQ 
The original adenovectors pAdHVO (comprising all Ad5 sequences except 
those nucleotides encompassing the El and E3 regions ) and pADHVE3 (comprising 
all Ad5 sequences except those nucleotides encompassing the El region), were each 
reconstructed so that they contained the modifications to the El region. This was 
accomplished by digesting the newly modified shuttle vector (MRKpdelEl shuttle) 
with Pacl and BstZl 101 and isolating the 2,734 bp fragment which corresponds to the 
adenoviros sequence. This fragment was co-transformed with DNA from either Clal 
linearized pAdHVO ^3- adenovector) or Clal linearized pAdHVE3 
0E3+adenovector) into E. coli B J5183 competent cells. At least two colonies from 
each transfonnation wore selected and grown in Terrific™ broth for 6-8 hours until 
turbidity was reached. DNA was extracted from each cell pellet and then transformed 
into E. coli XLl competent cells. One colony from each transformation was selected 
15 and grown for plasmid DNA purification. The plasmid was analyzed by restriction 
digestions to identify correct clones. The modified adenovectors were designated 
" 'FKpAdHVO (E3- plasmid) and MRKpAdHVE3 (E3+ plasmid). Virus from these 
new adenovectors (MRKHVO and MRKHVE3, respectively) as well as the old 
version of the adenovectors were generated in the PER.C6® cell lines to accommodate 
20 the following series of viral competition experiments. In addition, the multiple 
cloning site of the original shuttle vector contained Clal , Bamffl, Xho I, EcoRV, 
Hindm, Sal I. and Bgl 11 sites. This MCS was replaced with a new MCS containing 
Not I, Cla I, EcoRV and Asc I sites. This new MCS has been transfwred to the 
MRKpAdHVO and MRKpAdHVE3 pre-plasmids along with the modification made 
25 to the packaging region and pK gene. 

EXAMPLE 6 

/^pal ysis of the Effect of the Packaging S i^al Extension 
To study the effects of the modifications made to the El deletion re^on, the 
30 viruses obtained from the original backbone (pAdHVE3) and the new backbone 
(MRKpAdHVE3) were mixed together in equal MOI ratios (1:1 and 5:5) and 
passaged through several rounds; see Figure 5, Expt.#l. Both of the viruses in the 
experiment contained the E3 gene intact and did not contain a transgene. The only 
difference between the two viruses was within the region of the El deletion. 
35 Following the coinfection of the viruses at PI (passage 1), the mixtures were 

propagated through an additional 4 passages at which time the cells were harvested 
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and the virus extracted and purified by CsCl banding. The viral DNA was extracted 
and digested with HindUL and the digestion products were then radioactively labeled. 
For the controls, the respective pre-plasmids (pAdHVES ("OLD E3+"); 
MRKpAdHVES ("NEW E3+")) were also digested with Hvum (and Pad to remove 
5 the vector backbone) and subsequently labeled with [^^P]dATP. The radioactively 
labeled digestion products were subjected to gel electrophoresis and the gel was dried 
down onto Whatman paper before being exposed to autoradiographic fihn. Figure 6 
clearly shows that the new adenovirus which has the addition made to the packaging 
signal region has a growth advantage compared with the original adenovirus. In the 
10 experiments performed (at either ratio tested), only the digestion bands pertaining to 
die newly modified virus were present. The diagnostic band of size 3,206 (from the 
new virus) was clearly present. However, there was no evidence of the diagnostic 
band of size 2,737 bp expected from the original virus. 

15 EXAMPLE? 

Analysis of the Effect of the E3 Gene 
The second set of the vims competition study involved mixing equal MOI 
ratio (1:1) of the newly modified viruses, that obtained torn MRKpAdHVO and 
MRKpAdHVE3 (Figure 5, Expt. #2). In this set, both viruses had the new 

20 modifications made to die El deletion. The first virus (that from MRKpAdHVO) 
does not contain an E3 gene. The second virus (that from MElKpAdHVE3) does 
contain the E3 gene. Neither of the viruses contain a transgene. Following co- 
infection of the viruses, the mixtures were propagated through an additional 4 
passages at which time the cells were harvested and the total virus extracted and 

25 purified by CsCl banding. The viral DNA was extracted and digested with Hifum 
and the digestion products were then radioactively labeled. For the controls, the 
respective pre-plasmids MRKpAdHVO ("NEW E3-"); MRKpAdHVE3 ("NEW 
E3+") were also digested with Hindm (and Pad to remove the vector backbone) and 
then labeled with [33p]dATP. The radioactively labeled digestion products were 

30 subjected to gel electrophoresis and the gel was dried down onto Whatman paper 

before being exposed to autoradiographic fihn. Figure 6 shows the results of die viral 
DNA analysis of the E3+ virus and E3- virus mixing experiment. The diagnostic 
band coiresponding to die E3+ virus (5,665 bp) was present in greater amount 
compared with the diagnostic band of 3,010 bp coiresponding to the E3- virus. This 

35 indicates that the virus that contains the E3 gene is able to amplify more rapidly 
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compared with the vmis that does not contain an E3 gene. This increased 
amplification capacity has been confirmed by growth studies; see Table 4 below. 

EXAMPLES 

5 ConstructioTi of the new shuttle vector containing modifi ed gag transgene - 

"MRKpdelEl-CMVfno intronVFLgag-bGHpA" 
The modified plasmid pVlJnsCMV(no intron)^FLgag-bGHpA was digested 
with M^cl overnight and then digested with 5^1 for 2 hours at 50°C. The DNA was 
then treated with Mungbean nuclease for 30 mins at 30°C. The DNA mixture was 

10 desalted using the Qiaex n kit and then Klenow treated for 30 mins at 3TC to fuUy 
blunt the ends of the transgene fragment. The 2,559 bp transgene fragment was then 
gel purified. The modified shuttle vector (MRKpdelEl shuttle) was linearized by 
digestion with EcoRV. treated with calf intestinal phosphatase and the resulting 6,479 
bp fi^agment was then gel purified. The two purified fragments were then ligated 

15 together and several dozen clones were screened to check for insertion of the 

transgene within the shuttle vector. Diagnostic restriction digestion was perfonned to 
' 'ertify those clones carrying the transgene in the El parallel and El anti-parallel 
orientation. This strategy was followed to clone in the other gag transgenes in the 
MRKpdelEl shuttle vector. 

20 

EXAMPLE 9 
Construction of the MRK FG Adeno vectors 
The shuttle vector containing the HIV-1 gag transgene in the El parallel 
orientation, MRKpdelEl -CMV(no intron)-FLgag-bGHpA. was digested with Pacl. 

25 The reaction mixture was digested with BsfZlll. The 5,291 bp fragment was purified 
by gel extraction. The MRKpAdHVE3 plasmid was digested with Gal overnight at 
3TC and gel purified. About 100 ng of the 5,290 bp shuttle +transgene fragment and 
-100 ng of linearized MRKpAdHVE3 DNA were co-transformed into E. coli BJ5183 
chemicaDy competent cells. Several clones were selected and grown in 2 ml 

30 Terrific™ broth for 6-8 hours, until turbidity was reached. The total DNA from the 
cell pellet was purified using Qiagen alkaline lysis and phenol chloroform method. 
The DNA was precipitated with isopropanol and resuspended in 20 \\l dHiO. A 2 
aliquot of this DNA was transformed into E. coli XL-1 competent cells. A single 
colony from each separate transformation was selected and grown ovemigjit in 3 ml 

35 LB +100 |Xg/ml ampicillin. The DNA was isolated using Qiagen columns. A positive 
clone was identified by digestion with the restriction enzyme BstESl which cleaves 
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within the gag gene as well as the plasmid backbone. The pre-plasmid clone is 
designated MRKpAdHVE3+CMV(no intron)-FLgag-bGHpA and is 37,498 bp in size 
This strategy was followed to generate E3- and E3+ versions of each of the other gag 
ttansgene constractions in both El parallel and El anti-parallel versions. Figures 7A, 
5 7B and 7C show the various combinations of adenovectors constructed. 



EXAMPLE 10 
Plasmid Competition Studies 
A series of plasmid competiUon studies was earned out Briefly, the screening 

10 of the various combinations of new constructs was performed by mixing equal 

amounts of each of two competing plasmids. In the experiment shown in Figure 8A, 
plasmids containing the same transgene but in different orientations were mixed 
together to create a "competiUon" between the two plasmids. The aim was to look at 
the effects of transgene orientation. In the experiment shown in Figure 8B, plasmids 

15 containing different polyadenylation signals (but in the same orientation) were mixed 
together in equal amounts. The aim was to assess effects of polyA signals. FoUowing 
the initial transfection, the virus was passaged through ten rounds and the viral DNA 
.^yzed by radioactive restriction analysis. 

Analysis of the viral species from the plasmid mixing experiment (Figure 8 A) 

20 showed that adenovectors which had the transgene inserted in the El paraUel 

orientation amplified better and were able to out-compete the adenovirus which had 
the transgene inserted in the El anti-paraUel orientation. Viral DNA analysis of the 
mixtures at passage 3 and certainly at passage 6, showed a greater ratio of the vims 
carrying the transgene in the El parallel orientation compared with the El antiparallel 

25 version. By passage 10, the only viral species observed was the adenovector with the 
transgene in the El paraUel orientation for both transgenes tested (hCMV(no intron> 
FLgag-bGHpA and hCMV(no intron>FLgag-SPA). 

Analysis of the viral species from the plasmid mixing experiment #2 (Rgure 
8B) at passages 3 and 6 showed that the polyadenylation signals tested (bGHpA and 

30 SPA) did not have an effect on the growth of the virus. Even at passage 10 the two 
viral spedes in the mixture were still present in equal amounts. 
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EXAMPLE 11 

Vims peneratioTi of an enhanced adenoviral construc t - "MRK Ad5 HIV-lgag!; 
The results obtained from the competition study allowed us to make the 
following conclusions: (1) The packaging signal extension is beneficial; (2) Presence 
5 of E3 does enhance viral growth; (3) El parallel orientation is recommended; and (4) 
PolyA signals have no effect on the growth of the adenovirus. 

MRK Ad5 HTV-l gag exhibited the most desirable results. This construct 
contains the hCMV(no intron)-FLgag-bGHpA transgene inserted into the new E3+ 
adenovector backbone, MRKpAdHVES, in the El parallel orientation. We have 
10 designated this adenovector MRK Ad5 HIV-1 gag. This construct was prepared as 
oudined below: 

The pie-plasmid MRKpAdHVE3+CMV(no intron)-FLgag-bGHpA was 
digested was Pad to release the vector backbone and 3.3 jig was transfected by 
calcium phosphate method (Amersham Phamacia Biotech.) in a 6 cm dish containing 

15 PER.C6® cells at -60% confluence. Once CPE was reached (7-10 days), the culture 
was freeze/thawed three times and the cell debris pelleted. 1 ml of this cell lysate was 
to infect into a 6 cm dish containing PER.C6® cells at 80-90% confluence. Once 
CPE was reached, the culture was freeze/thawed three times and the cell debris 
pelleted. The cell lysate was then used to infect a 15 cm dish containing PER.C6® 

20 cells at 80-90% confluence. This infection procedure was continued and expanded at 
passage 6. The virus was then extracted from the cell pellet by CsCl method. Two 
bandings were performed (3-gradient CsCl followed by a continuous CsCl gradient). 
Following the second banding, the vims was dialyzed in A105 buffer. Viral DNA 
was extracted using pronase treatment followed by phenol chlorof onn. The viral 

25 DNA was then digested with Hindm and radioactively labeled with [33p]dATP. 
Following gel electrophoresis to separate the digestion products the gel was dried 
down on Whatman paper and then subjected to autoradiography. The digestion 
products were compared with the digestion products from the pre-plasmid (that had 
been digested with Pacl/Hindni prior to labeling). The expected sizes were 
30 observed, indicating that the virus had been successfully rescued. This strategy was 
used to rescue virus from each of the various adenovector plasmid constructs 
prepared. 
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EXAMPLE 12 
Stability Analyses 

To detennine whether the various adeiio vector constructs (e.g., MRK Ad5 
HTV-l gag) show genetic stability, the viruses were each passaged continually. The 
5 viral DNA was analyzed at passages 3, 6 and 10. Each virus maintained its conect 
genetic structure. In addition, the stability of the MRK Ad5 HIV-1 gag was analyzed 
under propagation conditions similar to that performed in large scale production. For 
this analysis, the transfcctions of MRK Ad5 HIV-1 gag as well as three other 
adenoviral vectors were repeated and the virus was purified at P3. The three other 

10 adenovectors were as follows: (1) that comprising hCMV(no intron)-Flgag with a 
bGHpA terminator in an E3- adenovector backbone; (2) that comprising hCMV(no 
intron)-Flgag with a SPA teraiination signal in an E3+ adenovector backbone, and 
that comprising a mCMV-Hgag with a bGHpA terminator in an E3+ adenovector 
backbone. All of the vectors have the transgene inserted in the El parallel orientation. 

15 Viral DNA was analyzed by radioactive restriction analysis to confirm that it was 
correct before being delivered to fermentation cell culture for continued passaging in 
serum-free media. At P5 each of the four vimses were purified and the viral DNA 
extracted for analysis by the restriction digestion and radiolabeUng procedure. This 
virus has subsequently been used in a series of studies (in vitro gag expression in COS 

20 cells, rodent study and rfiesus monkey study) as will be described below. The viruses 
-from P5 are^ shown in Figure 9. 

The passaging under serum-free conditions was continued for the 
MRKHVE3 (transgene-less, obtained from MRKpAdHVE3 pre-plasmid) and the 
MRKAdSmV-lgag (obtained from MRKpAdHVE3+CMV(no intron)-FLgag- 

25 bGHpA pre-plasmid) viruses. Figure 10 shows viral DNA analysis by radioactive 
restriction digestion at passage 11 for MRKHVE3, MRKAd5HIV-lgagE3-, and 
passage 11 and 12 for MRKAd5HIV-lgag. Aside from the fust lane which is the 
DNA niarker lane, the next three lanes are vims from the pre-plasmid controls 
(controls based on the original vims) - MRKpAdHVE3 (also referred to as 

30 "pMRKHVES"), MRKpAdHVE3+CMV(no intron)-FLgag-bGHpA, and 

pMRKAd5gag(E3-), respectively. As seen in Figure 10, each of Oie viral DNA 
samples show the expected bands with no extraneous bands showing. This signifies 
that there are no major variant adenovirus species present that can be detected by 
autoradiography. 

35 . Figure 1 1 shows the results of viral competition study between MRKHVE3 

and MRKAd5HIV-lgag. These vimses were mixed together at equal MOI (140 viral 
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particles each; 280 vp total) at passage 6 and continued to be passaged until PH. 
Aside from the first lane which is the DNA marker lane, the next two lanes are the 
pie-plasmid controls obtained from MRKpAdHVES and MRKpAdHVE3+CMV(no 
intron)-FLgag-bGHpA. The next two lanes are the viral DNA from the starting viral 
material at passage six. The last two lanes are the competition studies performed in 
dupUcate. The data in Rgure 11 shows the effect the gag transgene in culture. ^ 
Growth of a MRKAdSgag vims was compared with growth of a "transgene-less" 
MRKHVE3. These two viruses were infected at the same MOI (i.e. 140 vp each) at 
passage 6 and then passaged through to passage 11 and the viral pool was analyzed by 
radioactive restriction analysis. The data shows that one virus did not out compete the 
other. Therefore, the gag transgene did not show obvious signs of toxicity to the 
adenovirus. 

Analyas by Hindm ^gestion shows tiiat each virus specie is present in 
approximately equal amounts. As above, there does not appear to be signs of any 
extraneous bands. Figure 12 shows higher passage numbers for MRKAd5HIV-lgag 
grown under serum-containing conditions. The genome integrity again has been 
maintained and there is no evidence of rearrangements, even at the highest passage 
level (P21). 

Each of the four vectors shown in Figure 9 were analyzed for amplification 
capacity. Table 4 below shows the QPA analysis used in the estimation of viral 
amplification ratios at P4. The determination of the amplification ratio for the 
original HIV-1 gag construct is based on the cUnical lot at P12. It has been shown 
that ampUfication rates increases with higher passage number for the original virus. 
The reason for this observation is due to the emergeiice of variants which exhibit 
increased growth rates compared to the intact adenovector. With continued passaging 
of the original Ad gag vector, the level of variants increases and hence amplification 

rates increase also. 

The MRK Ad5 HTV-l gag virus has also been continually passaged under 
process conditions (i.e.. serum-free media). Viral DNA extracted from passages 11 
and 12 show no evidence of rearrangement 
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Table 4: 

Amplification Ratios Based on AEX and QPA Analysis of 
Virus Amplification fix)m Passage 3 to Passage 4. 



Ad gag construct 


Amplification Ratio 


MRKAdSgaR 


470 


HCMV-Flgag-bGHpA [E3-1 


115 


HCMV-Flgag-SPA fE3+l 


320 


mCMV-ELgag-bGHpA [E3+] 


420 


Original construct * 


40-50 



* This estimation is based on the clinical lot growth characteristics at Passage 12. 



EXAMPLE 13 

10 Analytical Evaluation of the enhanced Ad5 Constructs 

To study the effects of the transgene and the E3 gene on virus amplification, 
the enhanced adenoviral vector, MRK Ad5 HIV-1 gag, along with its transgene-less 
version (MRKpAdHVE3) and its E3- version (MRK Ad5 fflV-1 gag ES-), was 
studied for several passages under serum-free conditions. Table 5A shows the 

15 amplification ratios determined for passages P3 to P8 for MRK Ad5 HTV-l gag. 
Within a certain MOI range, it has been determined that the virus output is directly 
proportional to the virus input Therefore, the greater the number of virus particles 
per cell at infection, the greater the virus amount produced. Viral amplification ratios, 
on the other hand, are inversely proportional to the virus input. The lower the virus 

20 input, the greater the amplification ratio. 

Table 5B shows the amplification rates of the new E3+ vector backbone 
MRKpAdHVE3. It has a significantly lower rate of amplification compared with the 
gag transgene containing version. This may be contributed to the larger size MRK 
Ad5 HTV-l gag since it contains the transgene. This inclusion of the transgene brings 

25 the size of the adenovirus closer to the size of a wild type Ad5 virus. It is well known 
that adenoviruses amplify best when they are at close to their wild type genomic size. 
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Wild type Ad5 is 35,935 bp. The MRKpAdHVE3 is 32, 905 bp in length. The 
enhanced adenovector MRK Ad5 HIV-1 gag is 35,453bp (See Figure 14 for vector 
map; see also Figure 15A-X show the complete pre-adenoviral vector sequence, 
which includes an additional 2,021 bp of the vector backbone). 

5 Table 5C shows the amplification rates of the new E3- gag containing virus 

MRK Ad5 HTV-l gag E3-. Once again, this virus shows lower growth rate than the 
enhanced adenoviral vector. This may be attributed to the decreased sized of this 
virus (due to the E3 gene deletion) compared with wild type Ad5. The MRK Ad5 
HTV-l gag E3- virus is 32,810 bp in length. This can be compared with the wild type 

10 AdS which is 35,935 bp and MRK Ad5 HTV-l gag which is 35,453 bp in length. 
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Table 5A: Ampltfication ratios determined by AEX and QPA for MRKAdSgag over several 
continuous passaging in semnri free nnedia. Following P5, two replicate samples were taken 
1 and rep-2) and analyzed. 



MRKAdSgag repl 





Xv(1(/oo(hAi 
Intscttoo 


J), Vtabllty{%) 
Harvest 




NUmtw 


Ttar 

lO'vpAMcuhn 


lO'vpM 


QPA 


HbQo 
AEX3QPA 


AntiUicaflon 
Ratb 


AEX 
biiBmalContrd 


P4 


1^9. et% 


QL6a.eo% 


44 


46 


&7 


09 


1.72 


50 


470 

fMOUIJW 




PS 


1.38.93% 


a66, 47% 


40 


49 


07 


4.9 


1^ 


49 


170 


P6 


1.tM.B4% 


a6a.77% 


47 


48 


SuO 


t£ 


1.42 


41 


200 


P7 


ijbo.w% 


a88,ei% 


49.9 


90 


3l9 


U 


087 




SO 


P7 


1.09,87% 


a76.D9% 


50 


52 


02 


4.7 


1.70 


ai 


170 


P8 


1^, 94% 


036.64% 


47-6 


04 


8J> 


6.7 


1.10 


82 


310 


PB 


aeg, 95% 


039,73%" 


47J 


80 


4/1 


4.9 


1.03 


43 


179 


a.12 
2JM 


P10 


1.09.91% 


1.08.66% 


47JS 


69 


8.0 


Z6 


1.16 


28 


1W 


2.70 


f11 


1.19.08% 


096.63% 


47 


60 


16 


00 


1.15 


31 


110 


g.eo 

i70 
2.70 


«2 


aBB.91% 


aB5.63% 


47.6 


47 


&4 




1.20 


45 


200 


2.BS 
2.B0 


P19 




0.70.07% 


48 


48 


&0 


&e 


1.11 


S2 


210 


3.10 

3.te 




1M.92% 


000.67% 


40 


BS 


06 


4>« 






160 


3.28 
i27 


P19 


0.97,96% 


064,66% 


47 


47 


09 


7.1 






250 


3.12 
2.91 



Table 5B: Amplification ratios determined by AEX and QPA for MRKHVE3 over several 
continuous passaging in serum free media. MRKHVE3 is the new vector backbone which does 
?40T carry a transgene. 

MRKHVE3 





XV(t(f CBlbAl 


ni). MataU(y<%) 
Harvest 


Harvest Time 

IMXI. 


Cel Passaoa 
Number 


Tier 
lO^'vpMaiRure 


THar 
I0*vp«ea 


QPA 

i(f TaOboM 


RsltO 
AEXQPA 


AnjpBflcalloo 


AEX 


P4 


1.1087% 


1J2B.79% 


49 


54 


4.1 




1.70 


25 


300 




P5 


062.89% " 


1.10 77% 


47 


46 


4.S 


4.7 


1.24 


35 1 


170 




P6 


1.55.68% 


1.26.76% 


49.5 


GO 


1.2 


0.8 


ooe 


21 


30 




PB 


1.09»97% 


1.11.81% 


43 


52 


4J> 


06 


1.16 


34 


130 




P7 


1.17.91% 


1.22,91% 


47.6 


S4 


07 


12 


O50 


74 


110 




pa. 


0.9068% ~ 


1.41.63% 


48 


66 


2.1 


2.1 


0.47 


45 


76 


112 
2.84 


P8 


1.20 89% 


1.26.81% 


47A 


66 


08 


07 


0J20 


28 


25 


2.70 


P10 


099,62% 


1.65. 66% 


47 


60 


2.3 


2-3 


043 


53 


60 


a-M 

2.70 
2.70 


P11 


1.07.96% " 


1.26.63% 


46 


47 


2J 


2J 


041 


66 


00 


2.06 
2.60 


P12 


0.80 01%~ 


1.14,60% 




■49- 


BJ9 


7^4 


046 


123 


260 


3.16 
3.18 


P13 


1.90 85% 


1.14.66% 


453 


63 


u - 


3ja 






110 


026 
127 


P14 


007,96% 


1X0,68% 




4? 


04 


8.7 






350 


112 
2.91 


P18 


007,99% 


097.69% 


405 


40 


5J 


6,1 






218 


^78 

, 2=52 , 
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Table SC. Amplification ratios determined by AEX and QPA for MRKAd5gag(E3-) over several 
continuous passaging in serum free media. This construct is identical to the MRKAdSgag construct 
e3ccept that this version is DELETED of the E3 gene. 

5 

MRKAd5gag(E3') 





XvOOToSBaAi 


I).VI«taBy(%) 
HaivNi 


HsnsslTlTno 

Ma 


ColPassao" 


TllOf 
1D'*vp/Mou&uio 




QPA 

loTTaOi^ 


RbUo 


Antplflcailon 
Rano 


AEX 


P4 


1.82.77%" 


1.12.82% 


■ 


48 


3L0 


^2 


0JB2 


20 


100 
(M0I-12S) 




PS 


1.16.92% 


082.43% 


49 


49 


3.3 


Z3 


0.99 


34 


100 


P6 


1.71. 86% 


020.10% 


49 


SO 


4.7 


2.7 


1.70 


28 


100 


PA 


1.0SlV7% 


063.64% 


49;5 


52 


5.4 


5.0 


1.76 


ar 


480 


FT 


1.17.91% 


098. 72% 


47.50 


54 


7.1 


6.1 


0.67 


106 




P8 


aaa,88% 


0.77,48% 


48 


86 


3.1 


34 


0.68 


47 


116 


3.12 
2.B4 


PO 


1.20169% 


IM.72% 


48 


58 


1J 


1.5 


0.57 


92 


55- 


^70 
2.60 


P10 


099182% 


080.62% 


46.6 


60 


32 


32 


0.68 


47 


116 


2J0 
2.70 


P11 


1.07.96% 


088,70% 


48.5 


47 


M 


5.6 


0.68 


87 


20O 


2.88 
2.60 


P12 


080.91% 


0.87.59% 


SO 


48" 


&1 


6.4 


0.72 


71 


35 


3.18 
3.18 


P13 


1.36.95% 


0.91. S9% 


45.5 


S3 


7A 


3.8 






136 


028 
3.27 


P14 


097.96% 


081.74% 


48 


47 


08 


74) 






250 


3.12 
2^1 


Pifl 


0JB7,99% 


084.66% 


49 


49 


4S 


5J5 








2.78 
2.52 



EXAMPLE 14 
Gag Expression Analysis of the Novel Constructs 
In vitro gag analysis of the MRK Ad5 HTV-l gag and the original HTV-gag 
10 vectors (research and clinical lot) show comparable gag expression. The clinical lot 
shows only a slightly reduced gag expression level. The most noticeable difference is 
with the mCMV vector. This vector shows roughly 3 fold lower expression levels 
compared with the other vectors tested (which all contain hCMV promoters). The 
mCMV-FLgag with bGHpA assay was perfomied three times using different 
15 propagation and purification lots and it consistently exhibited weaker gag expression. 

EXAMPLE 15 

Evaluation of MRK Ad5 HIV-1 gag and Other gag-Containing 
. . Adenovectors in Balb/c Mice 
20 Cohorts of 10 balb/c mice were vaccinated intramuscularly with escalating 

doses of MRK Ad5 HIV-1 gag, and the research and clinical lots of original 
AdSHTV-lgag. Serum samples were collected 3 weeks post dose 1 and analyzed by 
anti-p24 sandwich EUSA. 
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Anti-p24 titers in mice that received MRK AdS HIV-l gag (107 and 109 
vp(viral particle) doses) were comparable (Figure 13) to those of the research lot of 
AdSHTV-l gag, for which much of the early rhesus data were generated on. These 
titers were also comparable when E3 is deleted (MRKAd5hCMVgagbGHpA(E3-)) or 
5 SPA is substituted for bGHpA terminator (MRKAdS hCMV-gag-SPA (E3+)) or 
murine CMV promoter is used in place of hCMV (MRKAdS mCMV-gag-bGHpA 
(E3+)) in the MRKAd5 backbone. 

The results shown in Table 7 indicate that the three other vectors ( in addition 
to the piefemed vector, MRK AdS HIV-l gag, are also capable of inducing strong 

10 anti-gag antibody responses in mice. Interestingly enough, while the mCMV-FLgag 
construct containing bGHpA and E3+ in an El parallel orientation showed lowest gag 
expression in the COS cell in vitro infection (Table 6) in comparison with the other 
vectors tested, it generated the greatest anti-gag antibody response this in vivo Balb/c 
study. Table 7 also shows a dose response in anti-gag antibody production in both the 

15 research and the clinical lot. As expected, the clinical lot shows reduced anti-gag 
antibody induction at each dosage level compared to the same dosage used for the 
research lot. 

Table 6: In vitro analysis for gag expression in COS cells by Elisa assay. 

20 



Viral Vectors* 


Jig gag/4.8xlOe5 COS/10e8 paits/48hr 


MRKAdSsag" 


1.40 


Clinical lot Ad5gag° 


1.28 


Research lot AdSgag*' 


1.32 


MCMVPL-gagbGHpA' 


0.42 



A26Qnin 

absorbance readings taken for viral particle determinations. 
^ MRKAdSgag was produced in serum free conditions and purified at P5. 
' Clinical lot# AdSgagFNOOOl 
25 ^ Research AdSFLgag lot# 6399 

• mCMVFL-gagbGHpA was produced in serum free conditions and purified at P5. 
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Table 7: mHlV020 Anti-p24 Ab Titers in Balhfe mice (n=10) vaccinated with various Adgag 
constructs and lots (3 week post dose1). 



Group 


Vaccine 


Dose 1 
(vd) 


GIUIT 


S£ upper 


SE lower 


1 
2 


"MRKAdSgag 

u 


10^7 
10^9 


25600 
409600 


5877 
94028 


4780 
76473 


3 
4 


hC^AV FL-gag bGHpA [E3-] -> 

a 


10^7 
10^9 


7352 
235253 


2077 
59767 


1620 

AftSam 


5 
6 


hCIVIVFL-gagSPAIE3fI-4 
■ 


10^7 
10A9 


12800 
310419 


9905 
99181 


236 
75165 


7 
8 


•^mCMV FL-gag bQHpA [E3+1 

m 


10^7 
10^ 


44572 
941014 


23504 
238068 


15389 
190636 


9 
10 


*hCMV FL-gag DGHpA [E3-] ^ 


10^7 
10^9 


3676 
117627 


934 
17491 


745 
15227 


i 1 

1 1 

12 
13 
14 


research lot hCMV intronA FL-gag bGHpA [E3-] <- 

m 
m 

u 


10^ 
10^7 
10^ 
10^9 


528 
14703 
58813 
204800 


262 
5274 
14942 
53232 


175 
3882 
11915 
42250 


15 
16 
17 
18 


clinical lot hCMVIntronA FL-gag bGHpA [£3-] <- 
• 

n 
M 


10^ 
10^7 
10^ 
10^ 


230 
4222 
19401 
89144 


82 
3405 
3939 
25187 


61 

1138 
3274 
19639 


19 


Naive 


none 


93 


7 





'.!^: Youil. Chen, Casimiro 
Vacdnallon: T. Toner, Q. Su 
Assay: M. Chen 

"The structure of MRKAdSgag Is: hCMVFL-gagbGHpA [E3+] -4 samsJa of ^«^KAcJ5gag 
used In this rodent study was used In the Rhesus mor*ey study aables 7 and B). 
••The same lot of mCMVFL-QagbGHpAlE3+l used in the in vitro study (T able 6) ws "sed here. 
This construct was designed by Volker Sandig. It contains a shorter fj^o" o' hCMV 
promoter than that used In the MRK constructs. The adenovector backbone is dent c^ to «» 
Sal backbone used in the original Adgag vector. Expression at 0e7 dose fronntha vector Is 7 
Sw towers^ the same dose ol the MRKAdSgag and 4 fold lower than the research tot. 



EXAMPLE 16 

Comparison of Humoral and CeUolar Responses Towards the Original Ad-gag 
Construct vdth the New MRK Ad5 fflV-l gag in Rhesus Monkeys 
5 Cohorts of 3 rhesus monkeys were vaccinated intramuscularly with MRK Ad5 

HIV-1 gag or the clinical AdSgag bulk at two doses, lO" vp and lO' vp. 
Immunizations were conducted at week 0, 4, and 25. Serum and PBMC samples were 
collected at selected time points. The serum sample were assayed for and-p24 Ab 
titers (using competitive based assay) and the PBMCs for antigen-specific IFN- 
10 gamma secretion following overnight stimulation with gag 20-mer peptide pool (via 
EUSpot assay). 

The results shown in Table 8 indicate comparable responses with respect to 
the generation of anti-gag antibodies. The firequencies of gag-specific T cells in 
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peripheral blood assummarized in Table 9 demonstrate a strong cellular immune 
response generated after a single dose with the new construct MRK Ad5 HIV-1 gag. 
The responses are also boostable with second dose of the same vector. The vector is 
also able to induce CD8+ T cell responses (as evident by remaining spot counts after 
5 CD4+ depletion of PBMCs) which are responsible for cytotoxic activity. 



Table 8 Anti-p24 antibody titers (in mMU/mL) in rhesus macaques immunized with 
jag-expiessing adenovectors (Protocol HIV203). 



vaccine 
MRKAdSaoo'.lO'^lI VD 
97N010 
97N116 
98X007 


Pre 

<10 
<10 
<10 


Wk4 

118 
62 
66- 


Wk8 

5528 
772 
3353 


Wkl2 

11523 
1447 
6156 


WK16 

7062 
1662 
6845 


- WK?P 

,21997 
2174 
3719 


ND 
NO 
ND 


Wk28 

51593 
20029 
24031 


MRKAdSQoa. 10*9vD 

97N120 
97N144 
98X008 


<10 
<10 
<10 


51 
18 
15 


204 
118 
444 


318 
274 
386 


366 
706 
996 


482 
888 
1072 


ND 
ND 
ND 


6550 
7136 
12851 


AdSgo^. ainlcd Lot. lO'^l 1 vd 

97X001 

97N146 


<10 
<10 


87 
72 


2579 


4718 


7174 


7250 


ND 


69226" 


98X009 

^^t^gqa ainlcd Lot. 10*9 vd 

97N020 


<10 
<10 


78 

<10 


3604 
4183 

143 


7380 
3946 

371 


7526 
3124 

390 


18906 
6956 

1821 


ND 
ND 

ND 


60283 
26226 

17177 


97X003 
98X012 

"^KAcSaxiOnCM^. bSHcA. E3+^ 


<10 
<10 


<10 
81 


39 
342 


93 
717 


156 
956 


596 
1558 


ND 
ND 


2053 
11861 


"airtnd AcBocp vector OiCMV^nlron A bGMcA. lor#FNOaoi 
MD^nerfdBtanilned 1 ) | 
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Table 9. Number of gag-specific T cells per million peripheral blood mononuclear 
ceUs (PBMCs) in rhesus monkeys immunized with gag-expressing adenovectors. 
Also included are those frequencies in PBMCs depleted of CD4* T ceUs. 



erpa Voodndton MonksiflD 


Mtedlo" 




Medio, _ 


NY, 

GoqH 


T=1 


GqqH 




WK—- 
GoqH 




«9ftlL. 


-_LE2a. 

Meda 


GoqH 


1 M^KAcfiOQD WhDlD 

\fy^\MQ 97N010CCE4-) 
97NU6 
97N116CC04-) 
98X007 


6 
4 
1 

11 
10 

20 


69 
38 
S86 
676 
STB 
965 


0 

1 

0 


395 
609 
1304 


0 
3 
0 
0 
3 
0 


1058 
993 
S34 
593 
2193 
2675 


0 
4 

1 


1174 
395 
2118 


3 
0 

1 

0 
3 
0 


776 

76 

261 

164 

1588 

1666 

"l06" 


4 

0 
0 
0 
0 
0 


1074 
694 
408 
666 
2113 
1278 

219 


97N144 
97N144CCD4-) 

98X008 
98XC)08(CD4.) 


5 
1 \ 

3 
6 
4 
14 


275 
170 
236 
148 
368 


1 

6 
1 


249 
438 
1090 


4 
0 
1 
0 
3 
0 


141 
85 
318 
285 
891 
1175 


4 
3 
4 


119 
256 
673 


9 
0 
1 

ND 
3 
0 


75 
98 
ND 
473 
391 


4 
1 
5 
0 
5 
4 


219 
373 
625 
735 
848 


3 AcfiOOOCflrtcdloit 97X001 

10*11 VP 97X001<qD4-) 
97N146 
97N146CCD4-) 

98X009 
OBX009(q>4>) 


0 
10 
3 
6 
0 
0 


261 
283 
ISO 
133 
93 
73 


t 
1 

a 


485 
485 
339 


0 
3 
0 
0 
3 
0 


817 
996 
339 
370 
559 
333 


0 
1 

0 


1220D 
1272 
696 


1 

0 
3 
0 

1 
0 


694 
1010 
1238 
654 
384 
225 


0 
0 
3 
0 
0 
0 


1856 
1123 
1785 
971 
1748 
644 


1(K9vp 97N020CCD^j 
97X003 

9n(0Q3eCO4-) 
98X012 

9BX012(q>4-) 


3 
10 
4 
9 
5 
11 


30 
29 
68 
40 
OS 
70 


1 
a 

3 


101 
134 
54 


0 
0 
0 
0 

1 

0 


66 
16 
18 
6 
34 
11 


0 

1 

0 


36 
38 
18 


0 
0 
4 
0 
0 
0 


26 
1 

38 
4 

20 
8 


0 
0 
6 
0 
1 
0 


41 
16 
81 
19 
121 
41 


5 NrfvB 96R041 
0S3F 


6 
14 


8 
18 


1 
5 


1 

16. 


0 
20 


0 
14 


0 
19 


0 
15 


0 
10 


0 
16 


1 

24 


0 
9 


RcBecla^ ara 4x10^ cr 2x10^5 oBlb per <*eM (dapendi 


ngcnspc 


density) 



The adenovectors described herein and, particularly, MRK. AdS HIV-1 gag, 
represent very promising HIV-gag adenovectors with respect to their enhanced 
growth characteristics in both senrai and, more importantly, in serum-free media 

10 conditions. In comparison with the current HIV-1 gag adenovector construct. MRK 
AdS KV-l gag shows a 5-10 fold increased amplification rate. We have shown that 
it is genetically stable at passage 21. This construct is able to generate significant 
ceUular unmune responses in vivo even at a relatively low dose of 10'^9 vp. The 
potency of tiie MRKAdSgag construct is comparable to, if not better than the original 

15 HTV-lgag vector as shown in this rhesus monkey study. 

EXAMPIJE17 

CODON OPTIMIZED HIV-1 POL AND CODON OPTIMZED 
HIV-1 POL MODIFICATIONS 
20 The open reading frames for the various synthetic pol genes disclosed herein 

comprise coding sequences for the reverse transcriptase (or RT which consists of a 
polymerase and RNase H activity) and integrase (IN). Tlie protein sequence is based 
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on that of ILeb2r, a clonal isolate of niB; this sequence has been shown to be closest 
to the consensus clade B sequence with only 16 nonidentical residues out of 848 
(Koiber, et al., 1998, Human retroviruses and AIDS, Los Alamos National 
Laboratory, Los Alamos, New Mexico). The skilled artisan will understand after 
5 review of this specification that any available HIV-1 or HIV-2 strain pix)vides a 
potential template for the generation of HIV pol DNA vaccine constructs disclosed 
herein. It is further noted that the protease gene is excluded from the DNA vaccine 
constructs of the present invention to insure safety from any residual protease activity 
in spite of mutational inactivation. The design of the gene sequences for both wild- 

10 type (wt-pol) and inactivated pol (lA-pol) incorporates the use of human preferred 
("humanized") codons for each amino acid residue in the sequence in order to 
maximize in vivo mammalian expression (Lathe, 1985, J. Mol Biol 183:1-12). As 
can-be discerned by inspecting the codon usage in SEQ ID NOs: 1, 3, 5 and 7, the 
following codon usage for mammalian optimization is prefeixed: Met (ATG), Gly 

15 (GGC), Lys (AAG), Tip (TGG), Ser (TCC), Arg (AGG), Val (GTG), Pro (CCC), Thr 
(ACQ, Glu (GAG); Leu (CTG), His (CAC). lie (ATC), Asn (AAC), Cys (TGC), Ala 
(GCC), Gin (GAG). Phe (nC) and Tyr (TAG). For an additional discussion relating 
-J mammalian (human) codon optimization, see WO 97/31115 (PCTAJS97/02294), 
which, as noted elsewhere in this specification, is hereby incorporated by reference. It 

20 is intended that the skilled artisan may use alternative versions of codon optimization 
or may omit this step when generating HIV pol vaccine constracts within the scope of 
the present invention. Therefore^ the present invention also relates to non-codon 
optimized versions of DNA molecules and associated recombinant adenoviral HTV 
vaccines which encode the various wild type and modified forms of the HIV Pol 

25 protein disclosed herein. However, codon optimization of these constructs is a 
preferred embodiment of this invention. 

A particular embodiment of this portion of the invention comprisies codon 
optimized nucleotide sequences which encode wt-pol DNA constructs (herein, "wt- 
pol" or "wt-pol (codon optimized))" wherein DNA sequences encoding the protease 

30 (PR) activity are deleted, leaving codon optimized "wild type" sequences which 

encode RT (reverse transcriptase and RNase H activity) and IN integrase activity. A 
DNA molecule which encodes this protein is disclosed herein as SEQ ID NO:l, the 
open reading frame being contained from an initiating Met residue at nucleotides 10- 
12 to a termination codon from nucleotides 2560-2562. SEQ ID N0:1 is as follows: 

35 AGATCTACCA TGGCCCCCAT CTCCCCCATT GAGACTGTGC CTGTGAAGCT GAAGCCTGGC 
ATGGATGGCC CCAAGGTGAA GCAGTGGCCC CTGACTGAGG AGAAGATCAA GGCCCTGGTG 
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GAAATCTGCA CTGAGATGGA GAAGGAGGGC AAAATCTCCA ASATTGGCCC CGAGAACCCC 
TACAACACCC CTGTGTTTGC CATCAAGAAG AAGGACTCCA CCAAGTGGAG GAAGCTGGTG 
GACTTCAGGG AGCTGAACAA GAGGACCCAG SACTTCTGG6 AGGTGCAGCT GGGCATCCCC 
CACCCCGCTG GCCTGAAGAA GAAGAAGTCT GTGACTGTGC TGGATGTGGG GGATGCCTAC 
TTCTCTGTGC CCCTGGATGA GGACTTCAGG AAGTACACTG CCTTCACCAT CCCCTCCATC 
AACAATGAGA CCCCTGGCAT CAGGTACCAG TACAATGTGC TGCCCCAGGG CTGGAAGGGC 
TCCCCT6CCA TCTTCCAGTC CTCCATGACC AAGATCCTGG AGCCCTTCAG GAAGCAGAAC 
CCTGACATTG TGATCTACCA GTACATGGAT GACCTGTAT6 TGGGCTCTGA CCTGGAGATT 
GGGCAGCACA GGACCAAGAT TGAGGAGCTG AGGCAGCACC TGCTGAGGT6 GGGCCTGACC 
ACCCCTCACA AGAAGCACCA GAAGGAGCCC CCCTTCCTGT GGATGGGCTA TGAGCTGCAC 
CCCGACAAGT G6ACTGTGCA GCCCATTGTG CTGCCTGAGA AGGACTCCTG GACTGTGAAT 
6ACATCCAGA AGCTGGTGG6 CAAGCTGAAC T6GGCCTCCC AAATCTACCC TGGCATCAAG 
GTGAGGCAGC TGTGCAAGCT GCT6AGGGGC ACCAAGGCCC TGACTGAGGT GATCCCCCTG 
ACTGAGGAGG CTCAGCTGGA GCTGGCTGA6 AACAGGGAGA TCCTGAAG6A GCCTGTGCAT 
GGGGTGTACT ATGACCCCTC CAAGGACCTC ATTGCTGAGA TCCAGAAGCA GGGCCAGGGC 
CAGTGGACCT ACCAAATCTA CCAGGAGCCC TTCAAGAACC TGAAGACTG6 CAAGTATGCC 
>GGATGAGGG GGGCCCACAC CAATGATGTG AAGCAGCTGA CTGAGGCTGT GCAGAAGATC 
ACCACTGAGT CCATTGTGAT CTGGGGCAAG ACCCCCAAGT TCAAGCTGCC CATCCA6AAG 
GAGACCTGGG AGACCTGGTG GACTGAGTAC TGGCAGGCCA CCTGGATCCC TGAGTGGGAG 
TTTGTGAACA CCCCCCCCCT GGTGAAGCTG TGGTACCAGC TGGAGAAGGA GCCCATTGTG 
GGGGCTGAGA CCTTCTATGT GGATGGGGCT GCCAACAGGG AGACCAAGCT GGGCAAGGCT 
GGCTATGTGA CCAACAGGGG CAGGCAGAAG GTGGTGACCC TGACTGACAC CACCAACCAG 
AA6ACTGAGC TCCAGGCCAT CTACCTGGCC CTCCAG6ACT CTG6CCTGGA GGTGAACATT 
CTGACTGACT CCCAGTATGC CCTGGGCATC ATCCAGGCCC AGCCTGATCA GTCTGAGTCT 
GAGCTGGT6A ACCAGATCAT TGAGCAGCTG ATCAAGAA6G AGAAGGTGTA CCTGGCCTGG 
GTCCCTGCCC ACAAGGGCAT TGGGGGCAAT GAGCAGGTGG ACAAGCTGGT GTCTGCTGGC 
ATCAGGAAGG TGCTGTTCCT GGATGGCATT GACAAGGCCC AGGATGAGCA TGAGAAGTAC 
CACTCCAACT GGAGGGCTAT GGCCTCTGAC TTCAACCTGC CCCCTGTGGT GGCTAAGGAG 
ATTGTGGCCT CCTGTGACAA GTGCCAGCTG AAGGGGGAGG CCATGCATGG GCAGGTGGAC 
TGCTCCCCTG GCATCTGGCA GCTGGACTGC ACCCACCTGG AGGGCAAGGT GATCCTGGTG 
■GCTGTGCATG TGCCCTGCGG CTACATTGAG GCTGAGGTGA TCCCTGCTGA GACAGGCCAG 
GAGACTGCCT ACTTCCTGCT GAAGCTGGCT GGCAGGTGGC CTGTGAAGAC CATCCACACT 
GACAATGGCT CCAACTTCAC TGGGGCCACA 6TGAGGGCT6 CCTGCTGGTG GGCTGGCATC 
AAGCAGGAGT TTGGCATCCC CTACAACCCC CAGTCCCAG6 GGGTGGTGGA GTCCATGAAC 
AAGGAGCTGA AGAAGATCAT TGGGCAGGTG AGGGACCAGG CTGAGCACCT GAAGACAfiCT 
GTGCA6ATGG CTGTGTTCAT CCACAACTTC AAGAG6AAGG GGGGCATCGG GGGCTACTCC 
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GCTGGGGAGA GGATTGTGGA CATCATTGCC ACAGACATCC AGACCAAGGA GCTCCAGAAG 
CAGATCACCA AGATCCAGAA CTTCAGGGTG TACTACAGGG ACTCCAGGAA CCCCCTGTGG 
AAGGGCCCTG CCAAGCTGCT GTGGAAGGGG. GAGGGGGCTG TGGTGATCCA GGACAACTCT 
GACATCAAGG TGGTGCCCAG GAGGAAGGCC AAGATCATCA GGGACTATGG CAAGCAGATG 
5 GCTGGGGATG ACTGTGTGGC CTCCAGGCAG GATGAGGACT AAAGCCCGGG CAGATCT (SEQ 
ID N0:1) . 

The open reading frame of the wDd type pol construct disclosed as SEQ ID 
N0:1 contains 850 amino acids, disclosed herein as SEQ ID N0:2, as follows: 
Met Ala Pro lie Ser Pro He Glu Thr Val Pro Val Lys Leu Lys Pro 

10 Gly Met Asp Gly Pro Lys Val Lys Gin Trp Pro Leu Thr Glu Glu Lys 
He Lys Ala Leu Val Glu He Cys Thr Glu Met Glu Lys Glu Gly Lys 
He Ser Lys He Gly Pro Glu Asn Pro Tyr Asn Thr Pro Val Phe Ala 
He Lys Lys Lys Asp Ser Thr Lys Trp Arg Lys Leu Val Asp Phe Arg 
Glu Leu Asn Lys Arg Thr Gin Asp Phe Trp Glu Val Gin Leu Gly He 

15 Pro His Pro Ala Gly Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp 
Val Gly Asp Ala Tyr Phe Ser Val Pro Leu Asp Glu Asp Phe Arg Lys 
Tyr Thr Ala Phe Thr He Pro Ser He Asn Asn Glu Thr Pro Gly He 
Tyr Gin Tyr Asn Val Leu Pro Glp Gly Trp Lys Gly Ser Pro Ala 
He Phe Gin Ser Ser Met Thr Lys He Leu Glu Pro Phe Arg Lys Gin 

20 Asn Pro Asp He Val He Tyr Gin Tyr Met Asp Asp Leu Tyr Val Gly 
Ser Asp Leu Glu He Gly Gin His Arg Thr Lys He Glu Glu Leu Arg 
Gin His Leu Leu Arg Trp Gly Leu Thr Thr Pro Asp Lys Lys His Gin 
Lys Glu Pro Pro Phe Leu Trp Met Gly Tyr Glu Leu His Pro Asp Lys 
Trp Thr Val Gin Pro He Val Leu Pro Glu Lys Asp Ser Trp Thr Val 

25 Asn Asp He Gin Lys Leu Val Gly Lys Leu Asn Trp Ala Ser Gin He 
Tyr Pro Gly He Lys Val Arg Gin Leu Cys Lys Leu Leu Arg Gly Thr 
Lys Ala Leu Thr Glu Val He Pro Leu Thr Glu Glu Ala Glu Leu Glu 
Leu Ala Glu Asn Arg Glu He Leu Lys Glu Pro Val His Gly Val Tyr 
Tyr Asp Pro Ser Lys Asp Leu He Ala Glu He Gin Lys Gin Gly Gin 

30 Gly Gin Trp Thr Tyr Gin He Tyr Gin Glu Pro Phe Lys Asn Leu Lys 
Thr Gly Lys Tyr Ala Arg Met Arg Gly Ala His Thr Asn Asp Val Lys 
Gin Leu Thr Glu Ala Val Gin Lys He Thr Thr Glu Ser He Val He 
Trp Gly Lys Thr Pro Lys Phe Lys Leu Pro He Gin Lys Glu Thr Trp 
Glu Thr Trp Trp Thr Glu Tyr Trp Gin Ala Thr Trp He Pro Glu Trp 

35 Glu Phe Val Asn Thr Pro Pro Leu Val Lys Leu Trp Tyr Gin Leu Glu 
Lys Glu Pro He Val Gly Ala Glu Thr Phe Tyr Val Asp Gly Ala Ala 
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Asn Arg Glu Thr Lys Leu Gly Lys Ala Gly Tyr Val Thr Asn Arg Gly 
Arg Gin Lys Val Val Thr Leu Thr Asp Thr Thr Asn Gin Lys Thr Glu 
Leu Gin Ala lie Tyr Leu Ala Leu Gin Asp Ser Gly Leu Glu Val Asn 
He val Thr Asp Ser Gin Tyr Ala Leu Gly He He Gin Ala Gin Pro 
5 Asp Gin Ser Glu Ser Glu Leu Val Asn Gin He He Glu Gin Leu He 
Lys Lys Glu Lys Val Tyr Leu Ala Trp Val Pro Ala His Lys Gly He 
Gly Gly Asn Glu Gin Val Asp Lys Leu Val Ser Ala Gly He Arg Lys 
Val Leu Phe Leu Asp Gly He Asp Lys Ala Gin Asp Glu His Glu Lys 
Tyr His Ser Asn Trp Arg Ala Met Ala Ser Asp Phe Asn Leu Pro Pro 
10 val val Ala Lys Glu He Val Ala Ser Cys Asp Lys Cys Gin Leu Lys 
Gly Glu Ala Met His Gly Gin Val Asp Cys Ser Pro Gly He Trp Gin 
Leu Asp Cys Thr His Leu Glu Gly Lys Val He Leu Val Ala Val His 
val Ala Ser Gly Tyr He Glu Ala Glu Val He Pro Ala Glu Thr Gly 
Gin Glu Thr Ala Tyr Phe Leu Leu Lys Leu Ala Gly Arg Trp Pro Val 
15 Lys Thr He His Thr Asp Asn Gly Ser Asn Phe Thr Gly Ala Thr Val 
Arg Ala Ala Cys Trp Trp Ala Gly He Lys Gin Glu Phe Gly He Pro 
Tyr Asn Pro Gin Ser Gin Gly Val Val Glu Ser Met Asn Lys Glu Leu 
uyB Lys He He Gly Gin Val Arg Asp Gin Ala Glu His Leu Lys Thr 
Ala val Gin Met Ala Val Phe He His Asn Phe Lys Arg Lys Gly Gly 
20 He Gly Gly Tyr Ser Ala Gly Glu Arg He Val Asp He He Ala Thr 
Asp He Gin Thr Lys Glu Leu Gin Lys Gin He T?hr Lys He Gin Asn 
Phe Arg Val Tyr Tyr Arg Asp Ser Arg Asn Pro Leu Trp Lys Gly Pro 
Ala Lys Leu Leu Trp Lys Gly Glu Gly Ala Val Val He Gin Asp Asn 
Ser Asp He Lys Val Val Pro Arg Arg Lys Ala Lys He He Arg Asp 
25 Tyr Gly Lys Gin Met Ala Gly Asp Asp Cys Val Ala Ser Arg Gin Asp 
Glu Asp (SEQ ID N0:2) . 

The present invention especiaUy relates to an adenoviral vector vaccine which 
comprises a codon optimized fflV-l DNA pol construct wherdn, in addition to 
deletion of the portion of the wild type sequence encoding the protease activity, a 

30 combination of active site residue mutations are introduced which are deleterious to 
HIV-1 pol (RT-RH-IN) activity of the expressed protein. Therefore, the present 
invention preferably relates to an adenoviral HIV-1 DNA pol-based vaccine wherein 
the construct is devoid of DNA sequences encoding any PR activity, as well as 
containing a mutation(s) which at least partially, and preferably substantially, 

35 aboUshes RT, RNase and/or IN activity. One type of HIV^l pol mutant which is part 
and parcel of an adenoviral vector vaccine may include but is not limited to a mutated 
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DNA molecule comprising at least one nucleotide substitution which results in a point 
mutation which effectively alters an active site within the RT, RNase and/or IN 
regions of the expressed protein, resulting in at least substantially decreased 
enzymatic activity for the RT, RNase H and/or IN functions of HIV-1 Pol. In a 
5 preferred embodiment of this portion of the invention, a HIV-1 DNA pol constract 
contains a mutation or mutations within the Pol coding region which effectively 
aboUshes RT, RNase H and IN activity. An especially preferable HIV-1 DNA pol 
construct in a DNA molecule which contains at least one point mutation which alters 
the active site of the RT. RNase H and IN domains of Pol, such that each activity is at 

10 least substantially abolished. Such a HIV-1 Pol mutant wiU most likely comprise at 
least one point mutation in or around each catalytic domain responsible for RT, 
RNase H and EST activity, respectfully. To this end, an especiaUy prefened HIV-1 
DNA pol construct is exemplified herein and contains nine codon substitution 
mutations which results in an inactivated Pol protein (lA Pol: SEQ ID N0:4, Figure 

15 17A-C) which has no PR, RT, RNase or IN activity, wherein three such point 

mutations reside within each of the RT, RNase and IN catalytic domains. Therefore, 
an especiaUy preferred exemplification is an adenoviral vaccine which comprises, in 
an appropriate fashion, a DNA molecule which encodes lA-pol. which contains all 
nine mutations as shown below in Table 1. An additional preferred amino acid 

20 residue for substitution is Asp551. localized witiiin tiie RNase domain of Pol. Any 
combination of tiie mutations disclosed herein may suitable and tiierefore may be 
utilized as an lA-Pol-based vaccine of tiie present invention. While addition and 
deletion mutations are contemplated and witiiin flie scope of tiie invention, tiie 
preferred mutation is a point mutation resulting in a substitution of tiie wild type 

25 amino acid with an altemative amino acid residue. 

Table 1 



30 
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aa residue 


mutant aa 


enzvme function 


Asp 


112 


Ala 


RT 


Asp 


187 


Ala 


RT 


Asp 


188 


Ala 


RT 


Asp ■ 


445 


Ala 


RNase H 


Glu 


480 


Ala 


RNaseH 


Asp 


500 


Ala 


RNase H 


Asp 


626 


Ala 


IN 


Asp 


678 


Ala 


IN 


Glu 


714 


Ala 


IN 
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It is prefeired that point mutations be incorporated into the lApol mutant adenoviral 
vaccines of the present invention so as to lessen the possibility of altering epitopes in 
and around the active site(s) of HTV-l Pol. 

To this end, SEQ ID N0:3 discloses the nucleotide sequence which codes for 
a codon optimized pol in addition to the nine mutations shown in Table 1, disclosed as 
follows, and referred to herein as "lApol": 

AGATCTACCA TGGCCCCCAT CTCCCCCATT GAGACTGTGC CTGTGAAGCT GAAGCCTGGC 
ATGGATGGCC CCAAGGTGAA GCAGTGGCCC CTGACTGAGG AGAAGATCAA GGCCCTGGTG 
GAAATCTGCA CTGAGATGGA GAAGGAGGGC AAAATCTCCA AGATTGGCCC CGAGAACCCC 
TACAACACCC CTGTGTTTGC CATCAAGAAG AAGGACTCCA CCAAGTGGAG GAAGCTGGTG 
GACTTCAGGG AGCTGAACAA GAGGACCCAG GACTTCTGGG AGGTGCAGCT GGGCATCCCC 
CACCCCGCTG GCCTGAAGAA GAAGAAGTCT GTGACTGTGC TGGCTGTGGG GGATGCCTAC 
TTCTCTGTGC CCCTGGATGA 6GACTTCAGG AAGTACACTG CCTTGACCAT CCCCTCCATC 
AACAATGAGA CCCCTGGCAT CAG6TACCAG TACAATGTGC TGCCCCAGGG CTGGAA66GC 
TCCCCTGCCA TCTTCCA6TC CTCCATGACC AAGATCCTGG AGCCCTTCAG GAAGCAGAAC 
CCTGACATTG TGATCTACCA 6TACATGGCT GCCCTGTATG TGGGCTCTGA CCTGGAGATT 
'^GT^CAGCACA GGACCAAGAT TGAGGAGCTG AGGCAGCACC TGCTGAGGTG GGGCCTGACC 
ACCCCTGACA AGAAGCACCA GAAGGAGCCC CCCTTCCTGT GGATGGGCTA TGAGCTGCAC 
CCCGACAAGT GGACTGTGCA GCCCATTGTG CTGCCTGAGA AGGACTCCTG GACTGTGAAT 
GACATCCAGA AGCTGGTGGG CAAGCTGAAC TGGGCCTCCC AAATCTACCC TGGCATCAAG 
GTGAGGCAGC TGTGCAAGCT GCTGAGGGGC ACCAAGGCCC TGACTGA66T GATCCCCCTG 
ACTGAGGA6G CTGAGCTGGA GCTGGCTGAG AACAGGGAGA TCCTGAAG6A GCCTGTGCAT 
GGGGTGTACT ATGACCCCTC CAAGGACCTG ATTGCT6AGA TCCAGAAGCA GGGCCAGGGC 
CAGTGGACCT ACCAAATCTA CCAGGAGCCC TTCAAGAACC TGAAGACTGG CAAGTATGCC 
AGGATGAGGG G6GCCCACAC CAATGATGTG AAGCAGCTGA CTGAGGCTGT GCAGAAGATC 
ACCACTGAGT CCATTGTGAT CTGGGGCAAG ACCCCCAAGT TCAAGCTGCC CATCCAGAAG 
GAGACCTGGG AGACCTGGTG GACTGAGTAC TGGCAGGCCA CCTGGATCCC TGAGTGGGAG 
TTTGTGAACA CCCCCCCCCT GGTGAAGCTG TGGTACCAGC TGGAGAAGGA GCCCATTGTG 
GGGGCTGAGA CCTTCTATGT GGCTGGGGCT GCCAACAGGG AGACCAAGCT GGGCAAGGCT 
GGCTATGTGA CCAACAGGGG CAGGCAGAAG GTGGTGACCC TGACTGACAC CACCAACCAG 
AAGACTGCCC TCCAGGCCAT- CTACCTGGCC CTCCAGGACT CTGGCCTGGA GGTGAACATT 
GTGACTGCCT CCCAGTATGC CCTGGGCATC ATCCAGGCCC AGCCTGATCA GTCTGAGTCT 
GAGCTGGTGA ACCAGATCAT TGAGCAGCTG ATCAAGAAGG AGAAGGTGTA CCTGGCCTGG 
GTGCCTGCCC ACAAGGGCAT TGGGGGCAAT GAGCAGGTGG ACAAGCTGGT GTCTGCTGGC 
ATCAGGAAGG T6CTGTTCCT GGAT6GCATT GACAAGGCCC AGGATGAGCA T6AGAAGTAC 
CACTCCAACT GGAGGGCTAT GGCCTCTGAC TTCAACCTGC CCCCTGTGGT GGCTAAGGAG 
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ATTGTGGCCT CCTGTGACAA GTGCCAGCTG AAGGGGGAGG CCAOXSCATGG GCAGGTGGAC 
TGCTCCCCTG GCATCTGGCA GCTGGCCTGC ACCCACCTGG AGGGCAAGGT GATCCOGGTG 
GCTGTGCATG TG6CCTCCGG CTACATTGAG GCTGAGGTGA TCCCTGCTGA GACAGGCCAG 
GAGACTGCCT ACTTCCTGCT GAAGCTGGCT GGCAGGTGGC CTGTGAAGAC CATCCACACT 
5 GCCAATGGCT CCAACTTCAC TGGGGCCACA GTGAGGGCTG CCTGCTGGTG GGCTGGCATC 
AAGCAGGAGT TTGGCATCCC CTACAACCCC CAGTCCCAGG GGGTGGTGGC CTCCATGAAC 
AAGGAGCTGA AGAAGATCAT TGGGCAGGTG AGGGACCAGG CTGAGCACCT GAAGACAGCT 
GTGCAGATGG CTGTGTTCAT CCACAACTTC AAGAGGAAGG GGGGCATCGG GGGCTACTCC 
GCTGGGGAGA GGATTGTGGA CATCATTGCC ACAGACATCC AGACCAAGGA GCTCCAGAAG 
10 CAGATCACCA AGATCCAGAA CTTCAGGGTG TACTACAG6G ACTCCAGGAA CCCCCTGTGG 
AAGGGCCCTG CCAAGCTGCT GTG6AAGGGG GAGGGGGCTG TGGTGATCCA GGACAACTCT 
GACATCAAGG TGGTGCCCAG GAGGAAGGCC AAGATCATCA GGGACTATGG CAAGCAGATG 
GCTGGG6ATG ACTGTGTGGC CTCCAGGCAG GATGAGGACT AAAGCCCGGG CAGATCT (SEQ ID 
N0:3) . 

15 In order to produce the lA-pol-based adenoviral vaccines of the present 

invention, inactivation of the enzymatic functions was achieved by replacing a total of 
nine active site residues from the enzyme subunits with alanine side^hains. As 
snown in Table 1, all residues that comprise the catalytic triad of the polymerase, 
namely Aspll2, Aspl87, and Aspl88, were substituted with alanine (Ala) residues 

20 (Larder, et al, Nature 1987, 327: 716-717; Larder, et al., 1989, Proc. Natl Acad. ScL 
1989; 86: 4803-4807). Three additional mutations were introduced at Asp445, 
Glu480 and AspSOO to abolish RNase H activity (Asp551 was left unchanged in this 
lA Pol constmct), with each residue being substituted for an Ala residue, respectively 
(Davies, et al., 1991, Science 252:, 88-95; Schatz, et al., 1989, FEES Utt. 257: 311- 

25 314; Mizrahi, et al., 1990, Nucl. Acids. Res, 18: pp. 5359-5353). HIV pol integrase 
function was abolished through three mutations at Asp626, Asp678 and GIu714. 
Again, each of these residues has been substituted with an Ala residue (Wiskerchen, 
et al., 1995, J. Virol. 69: 376-386; Leavitt, et al., 1993, J. Biol. Chem. 268: 2113- 
21 19), Amino acid residue Pro3 of SEQ ID N0:4 marks the start of the RT gene. 

30 The complete amino acid sequence of L\-Pol is disclosed herein as SEQ ID NO:4 and 
Figure 17A-C, as follows: 

Met Ala Pro lie Ser Pro He Glu Thr Val Pro Val Lys Leu Lys Pro 
Gly Met Asp Gly Pro Lys Val. Lys Gin Trp Pro Leu Thr Glu Glu Lys 
He Lys Ala Leu Val Glu He Cys Thr Glu Met Glu Lys Glu Gly Lys 
35 He Ser Lys He Gly Pro Glu Asn Pro Tyr Asn Thr Pro Val Phe Ala 
He Lys Lys Lys Asp Ser Thr Lys Trp Arg Lys Leu Val Asp Phe Arg 
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.Glu Leu Asn Lys Arg Thr Gin Asp Phe Trp Glu Val Gin Leu Gly lie 
Pro His Pro Ala Gly Leu Lys Lys Lys Lys Ser Val Thr val Leu Ala 
Val Gly Asp Ala Tyr Phe Ser Val Pro Leu Asp Glu Asp Phe Arg Lys 
Tyr Thr Ala Phe Thr He Pro Ser He Asn Asn Glu Thr Pro Gly He 
Arg Tyr Gin Tyr Asn Val Leu Pro Gin Gly Trp Lys Gly Ser Pro Ala 
He Phe Gin Ser Ser Met Thr Lys He Leu Glu Pro Phe Arg Lys Gin 
Asn Pro Asp He Val He Tyr Gin Tyr Met Ala Ala Leu Tyr Val Gly 
Ser Asp Leu Glu He Gly Gin His Arg Thr Lys He Glu Glu Leu Arg 
Gin His Leu Leu Arg Trp Gly Leu Thr Thr Pro Asp Lys Lys His Gin 
Lys Glu Pro Pro Phe Leu Trp Met Gly Tyr Glu Leu His Pro Asp Lys 
Trp Thr Val Gin Pro He Val Leu Pro Glu Lys Asp Ser Trp Thr Val 
Asn Asp He Gin Lys Leu Val Gly Lys Leu Asn Trp Ala Ser Gin He 
Tyr Pro Gly He Lys Val Arg Gin Leu Cys Lys Leu Leu Arg Gly Thr 
Lys Ala Leu Thr Glu Val He Pro Leu Thr Glu Glu Ala Glu Leu Glu 
Leu Ala Glu Asn Arg Glu He Leu Lys Glu Pro Val His Gly Val Tyr 
Tyr Asp Pro Ser Lys Asp Leu He Ala Glu He Gin Lys Gin Gly Gin 
"ly Gin Trp Thr Tyr Gin He Tyr Gin Glu Pro Phe Lys Asn Leu Lys 
Thr Gly Lys Tyr Ala Arg Met Arg Gly Ala His Thr Asn Asp Val Lys 
Gin Leu Thr Glu Ala Val Gin Lys He Thr Thr Glu Ser He Val He 
20 Trp Gly Lys Thr Pro Lys Phe Lys Leu Pro He Gin Lys Glu Thr Trp 
Glu Thr Trp Trp Thr Glu Tyr Trp Gin Ala Thr Trp He Pro Glu Trp 
Glu Phe Val Asn Thr Pro Pro Leu Val Lys Leu Trp Tyr Gin Leu Glu 
Lys Glu Pro He Val Gly Ala Glu Thr Phe Tyr Val Ala Gly Ala Ala 
Asn Arg Glu Thr Lys Leu Gly Lys Ala Gly Tyr Val Thr Asn Arg Gly 
25 Arg Gin Lys Val Val Thr Leu Thr Asp Thr Thr Asn Gin Lys Thr Ala 
Leu Gin Ala He Tyr Leu Ala Leu Gin Asp Ser Gly Leu Glu Val Asn 
He Val Thr Ala Ser Gin Tyr Ala Leu Gly He He Gin Ala Gin Pro 
Asp Gin Ser Glu Ser Glu Leu Val Asn Gin He He Glu Gin Leu He 
Lys Lys Glu Lys Val Tyr Leu Ala Trp Val Pro Ala His Lys Gly He 
30 Gly Gly Asn Glu Gin Val Asp Lys Leu Val Ser Ala Gly He Arg Lys 
Val Leu Phe Leu Asp Gly He Asp Lys Ala Gin Asp Glu His Glu Lys 
Tyr His Ser Asn Trp Arg Ala Met Ala Ser Asp Phe Asn Leu Pro Pro 
Val val Ala Lys Glu He Val Ala Ser Cys Asp Lys Cys Gin Leu Lys 
Gly Glu Ala Met His Gly Gin Val Asp Cys Ser Pro Gly He Trp Gin 
35 Leu Ala Cys Thr His Leu Glu Gly Lys Val He Leu Val Ala Val His 
Val Ala Ser Gly Tyr He Glu Ala Glu Val He Pro Ala Glu Thr Gly 



63 



wo 02/022080 



PCT/USOl/28861 



Gin Glu Thr Ala Tyr Phe Leu Leu Lys Leu Ala Gly Arg Trp Pro Val 
Lys Thr lie His Thr Ala Asn Gly Ser Asn Phe Thr Gly Ala Thr Val 
Arg Ala Ala Cys Trp Trp Ala Gly lie Lys Gin Glu Phe Gly lie Pro 
Tyr Asn Pro Gin Ser Gin Gly Val Val Ala Ser Met Asn Lys Glu Leu 
5 Lys Lys He He Gly Gin Val Arg Asp Gin Ala Glu His Leu Lys Thr 
Ala Val Gin Met Ala Val Phe He His Asn Phe Lys Arg Lys Gly Gly 
He Gly Gly Tyr Ser Ala Gly Glu Arg He Val Asp He He Ala Thr 
Asp He Gin Thr Lys Glu Leu Gin Lys Gin He Thr. Lys He Gin Asn 
Phe Arg Val Tyr Tyr Arg Asp Ser Arg Asn Pro Leu Trp Lys Gly Pro 
10 Ala Lys" Leu Leu Trp Lys Gly Glu Gly Ala Val Val He Gin Asp Asn 
Ser Asp He Lys Val Val Pro Arg Arg Lys Ala Lys He He Arg Asp 
Tyr Gly Lys Gin Met Ala Gly Asp Asp Cys Val Ala Ser Arg Gin Asp 
Glu Asp (SEQ ID N0:4) . 

As noted above, it will be understood that any combination of the mutations 

15 disclosed above may be suitable and therefore be utilized as an lA-pol-based 

adenoviral HIV vaccine of the present invention, either when administered alone or in 
a combined modality regime and/or a prime-boost regimen. For example, it may be 
^ssible to mutate only 2 of the 3 residues within the respective reverse transcriptase, 
RNase-H, and integrase coding regions while still aboUshing these enzymatic 

20 activities. However, the lA-pol constmct described above and disclosed as SEQ ID 
N0:3, as well as the expressed protein (SEQ ID NO:4;) is preferred It is also 
preferred that at least one mutation be present in each of the three catalytic domains. 

Another aspect of this portion of the invention are codon optimized HIV-1 
Pol-based vaccine constructions which comprise a eukaryotic trafficldng signal 

25 peptide such as from tPA (tissue-type plasminogen activator) or by a leader peptide 
such as is found in highly expressed mammalian proteins such as immunoglobulin 
leader peptides. Any functional leader peptide may be tested for efficacy. However, 
a preferred embodiment of the present invention, as with HIV-1 Nef constructs shown 
herein, is to provide for a HIV-1 Pol mutant adenoviral vaccine construction wherein 

30 the pol coding region or a portion thereof is operatively linked to a leader peptide, 
preferably a leader peptide from human tPA. In other words, a codon optimized 
HIV-1 Pol mutant such as lA-Pol (SEQ ID N0:4) may also comprise a leader peptide 
at the amino terminal portion of the protein, which may effect cellular trafficking and 
hence, immunogenicity of the expressed protein within the host cell. As noted in 

35 Figure 16A-B, a DNA vector which may be utilized to practice the present invention 
may be modified by known recombinant DNA methodology to contain a leader signal 
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peptide of interest, such that downstream cloning of the modified HIV-1 protein of 
interest results in a nucleotide sequence which encodes a modified HTV-l tPA/Pol 
protein. In the alternative, as noted above, insertion of a nucleotide sequence which 
encodes a leader peptide may be inserted into a DNA vector housing the open reading 
5 frame for the Pol protein of interest. Regardless of the cloning strategy, the end result 
is a polynucleotide vaccine which comprises vector components for effective gene 
expression in conjunction with nucleotide sequences which encode a modified HIV-1 
Pol protein of interest, including but not Ihnited to a HIV-1 Pol protein which 
contains a leader peptide. The amino acid sequence of the human tPA leader utilized 
10 herein is as foUows: MDAMKRGLCCVLIJU:GAVFVSPSEISS (SEQ ID N0:17). 
Therefore, another aspect of the present invention is to generate HIV-1 Pol-based 
vaccine constructions which comprise a eukaryotic trafficking signal peptide such as 
from tPA. To tiiis end, tiie presemt invention relates to a DNA molecule which 
encodes a codon optimized wt-pol DNA construct wherein die protease (PR) activity 
15 is deleted and a human tPA leader sequence is fijsed to the 5' end of the coding region. 
A DNA molecule which encodes this protein is disclosed herein as SEQ ID N0:5, the 
open reading firame disclosed herein as SEQ ID N0:6. 

To this end, the present invention relates to a DNA molecule which encodes a 
codon optimized wt-pol DNA construct whenan tiie protease (PR) activity is deleted 
20 and a human tPA leader sequence is fused to the 5' end of die coding region ( herein, 
"tPA-wt-pol"). A DNA molecule which encodes tiiis protein is disclosed herein as 
SEQ ID NO:5, tiie open reading firame being contained from an initiating Met residue 
at nucleotides 8-10 to a termination codon firom nucleotides 2633-2635. SEQ ID 
N0:5 is as follows: 

25 GATCACCATC 6ATGCAATGA A6AGAGGGCT CTGCTGTGTG CTGCTGCTGT GTGGAGCAGT 
CTTCGTTTCG CCCAGCGAGA TCTCCGCCCC CATCTCCCCC ATTGAGACTG TGCCTGTGAA 
GCTGAAGCCT GGCATGGATG GCCCCAAGGT GAAGCAGTGG CCCCTGACTG AGGAGAAGAT 
CAAGGCCCTG GTGGAAATCT GCACTGAGAT GGAGAAGGAG GGCAAAATCT CCAAGATTGG 
CCCCGAGAAC CCCTACAACA CCCCTGTGTT TGCCATCAAG AAGAAG6ACT CCACCAAGTG 

30 GAGGAAGCTG GTGGACTTCA GGGAGCTGAA CAAGAGGACC CAGGACTTCT GGGAGGTGCA 
GCTGGGCATC CCCCACCCCG CTGGCCTGAA GAAGAAGAAG TCTGTGACTG TGCTG6ATCT 
GGGGGATGCC TACTTCTCTG TGCCCCTGGA T6AGGACTTC ASGAAGTACA CTGCCTTCAC 
CATCCCCTCC ATCAACAAT6 AGACCCCTGG CATCAGGTAC CAGTACAATG TGCTGCCCCA 
GGGCTGGAAG GGCTCCCCTG CCATCTTCCA GTCCTCCATG ACCAAGATCC TGGAGCCCTT 
35 CAGGAAGCAG AACCCTGACA TTGTGATCTA CCAGTACATG GATGACCTGT ATGTGGGCTC 
T6ACCTG6A6 ATTGGGCAGC ACAQGACCAA GATT6AGGA6 CTGAGGCAGC ACCTGCT6AG 
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GTGGGGCCTG ACCACCCCTG ACAAGAAGCA CCAGAAGGAG CCCCCCTTCC TGTGGATGGG 
CTATGAGCTG CACCCCGACA AGTGGACTGT GCAGCCCATT GTGCTGCCTG AGAAGGACTC 
CTGGACTGTG AATGACATCC AGAAGCTGGT GGGCAAGCTG AACTGGGCCT CCCAAATCTA 
CCCTGGCATC AAGGTGAGGC AGCTGTGCAA GCTGCTGAGG GGCACCAAGG CCCTGACTGA 
5 GGTGATCCCC CTGACTGAGG AGGCTGAGCT GGAGCTGGCT GAGAACAGGG AGATCCTGAA 
GGAGCCTGTG CATGGGGTGT ACTATGACCC CTCCAAGGAC CTGATTGCTG AGATCCAGAA 
GCAGGGCCAG GGCCAGTGGA CCTACCAAAT CTACCAGGAG CCCTTCAAGA ACCTGAAGAC 
TGGCAAGTAT GCCAGGATGA GGGGG6CCCA CACCAATGAT GT6AAGCAGC TGACTGAGGC 
TGTGCAGAAG ATCACCACTG AGTCCATTGT GATCTGGGGC AA6ACCCCCA AGTTCAAGCT 

10 GCCCATCCAG AAGGAGACCT GGGA6ACCTG GTG6ACTGAG TACTGGCAGG CCACCTGGAT 
CCCTGAGTGG 6AGTTTGTGA ACACCCCCCC CCTGGTGAAG CTGTGGTACC AGCTGGAGAA 
GGAGCCCATT GTGGGG6CTG AGACCTTCTA TGTGGATGGG GCTGCCAACA GGGAGACCAA 
GCTGGGCAAG GCTGGCTATG TGACCAACAG GGGCAGGCAG AAGGT6GTGA CCCTGACTGA 
CACCACCAAC CAGAAGACTG AGCTCCAGGC CATCTACCTG GCCCTCCAGG ACTCTGGCCT 

15 GGAGGTGAAC ATTGTGACTG ACTCCCAGTA TGCCCTGGGC ATCATCCAGG CCCAGCCTGA 
TCAGTCTGAG TCTGAGCTGG TGAACCAGAT CATTGAGCAG CTGATCAAGA AGGAGAA6GT 
GTACCTGGCC TGGGTGCCTG CCCACAAGGG CATTGGGGGC AATGAGCAGG TGGACAAGCT 
Gu-it5TCTGCT GGCATCAGGA AGGTGCTGTT CCTGGATGGC ATTGACAAGG CCCAGGATGA 
GCATGAGAAG TACCACTCCA ACTGGAGGGC TATGGCCTCT GACTTCAACC TGCCCCCTGT 

20 GGTGGCTAAG GAGATTGTGG CCTCCTGTGA CAAGTGCCAG CTGAAGGGGG AGGCCATGCA 
TGGGCAGGTG GACTGCTCCC CTGGCATCTG GCAGCTGGAC TGCACCCACC TGGAGQGCAA 
GGTGATCCTG GTGGCTGTGC ATGTGGCCTC CGGCTACATT GAGGCT6AGG TGATCCCTGC 
TGAGACAGGC CAGGAGACTG CCTACTTCCT GCTGAAGCTG GCTGGCAGGT GGCCTGTGAA 
GACCATCCAC ACTGACAATG GCTCCAACTT CACTGGGGCC ACAGTGAGGG CTGCCTGCTG 

25 GTGGGCTGGC ATCAAGCAGG AGTTTGGCAT CCCCTACAAC CCCCAGTCCC AGGGGGTGGT 
GGAGTCCATG AACAAGGAGC TGAAGAAGAT CATTGGGCAG GTGAGGGACC AGGCTGAGCA 
CCTGAAGACA GCTGTGCAGA TGGCTGTGTT CATCCACAAC TTCAAGAGGA AGGGGGGCAT 
CGGGGGCTAC TCCGCTGGGG AGAGGATTGT GGACATCATT GCCACAGACA TCCAGACCAA 
GGAGCTCCAG AAGCAGATCA CCAA6ATCCA GAACTTCAGG GTGTACTACA GGGACTCCAG 

30 GAACCCCCTG TGGAAGGGCC CTGCCAAGCT GCTGTGGAAG GGGGAGGGGG CTGTGGTGAT 
CCAGGACAAC TCTGACATCA AGGTGGTGCC CAGGAGGAAG GCCAAGATCA TCAGGGACTA 
TGGCAAGCAG ATGGCTGGGG ATGACT6TGT GGCCTCCAGG CAGGATGAGG ACTAAAGCCC 
GGGCAGATCT (SEQ ID N0:5) . 

The open reading frame of the wild type tP A-pol construct disclosed as SEQ 
35 ID N0:5 contains 875 amino acids, disclosed herein as SEQ ID NO:6, as follows: 
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
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Ala Val Phe Val Ser Pro Ser Glu lie Ser Ala Pro lie Ser Pro He 
Glu Thr Val Pro Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val 
Lys Gin Trp Pro Leu Thr Glu Glu Lys lie Lys Ala Leu Val Glu He 
Cys Thr Glu Met Glu Lys Glu Gly Lys He Ser Lys He Gly Pro Glu 
5 Asn Pro Tyr Asn Thr Pro Val Phe Ala He Lys Lys Lys Asp Ser Thr 
Lys Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gin 
Asp Phe Trp Glu Val Gin Leu Gly He Pro His Pro Ala Gly Leu Lys 
Lys Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser 
Val Pro Leu Asp Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr He Pro 
10 Ser He Asn Asn Glu Thr Pro Gly He Arg Tyr Gin Tyr Asn Val Leu 
Pro Gin Gly Trp Lys Gly Ser Pro Ala He Phe Gin Ser Ser Met Thr 
Lys He Leu Glu Pro Phe Arg Lys Gin Asn Pro Asp He Val He Tyr 
Gin Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu He Gly Gin 
His Arg Thr Lys He Glu Glu Leu Arg Gin His Leu Leu Arg Trp Gly 
15 Leu Thr Thr Pro Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu Trp 
Met Gly Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gin Pro He Val 
T,eu Pro Glu Lys Asp Ser Trp Thr Val Asn Asp He Gin Lys Leu Val 
Gly Lys Leu Asn Trp Ala Ser Gin He Tyr Pro Gly He Lys Val Arg 
Gin Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val He 
20 Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu He 
Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu 
He Ala Glu He Gin Lys Gin Gly Gin Gly Gin Trp Thr Tyr Gin He 
Tyr Gin Glu Pro Phe Lys Asn Leu Lys Thr Gly Lys Tyr Ala Arg Met 
Arg Gly Ala His Thr Asn Asp Val Lys Gin Leu Thr Glu Ala Val Gin 
25 Lys He Thr Thr Glu Ser He Val He Trp Gly Lys Thr Pro Lys Phe 
Lys Leu Pro He Gin Lys Glu Thr Trp Glu Thr Trp Trp Thr Glu Tyr 
Trp Gin Ala Thr Trp He Pro Glu Trp Glu Phe Val Asn Thr Pro Pro 
Leu Val Lys Leu Trp Tyr Gin Leu Glu Lys Glu Pro He Val Gly Ala 
Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn Arg Glu Thr Lys Leu Gly 
30 Lys Ala Gly Tyr Val Thr Asn Arg Gly Arg Gin Lys Val Val Thr Leu 
Thr Asp Thr Thr Asn Gin Lys Thr. Glu Leu Gin Ala lie Tyr Leu Ala 
Leu Gin Asp Ser Gly Leu Glu Val Asn He Val Thr Asp Ser Gin Tyr 
Ala Leu Gly He He Gin Ala Gin Pro Asp Gin Ser Glu Ser Glu Leu 
Val Asn Gin He He Glu Gin Leu He Lys Lys Glu Lys Val Tyr Leu 
35 Ala Trp Val Pro Ala His Lys Gly He Gly Gly Asn Glu Gin Val Asp 
Lys Leu Val Ser Ala Gly He Arg Lys Val Leu Phe Leu Asp Gly He 



67 



wo 02/022080 PCT/USOl/28861 



Asp Lys Ala Gin Asp Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala 
Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu He Val 
Ala Ser Cys Asp Lys Cys Gin Leu Lys Gly Glu Ala Met His Gly Gin 
Val Asp Cys Ser Pro Gly He Trp Gin Leu Asp Cys Thr His Leu Glu 
5 Gly Lys Val He Leu Val Ala Val His Val Ala Ser Gly Tyr He Glu 
Ala Glu Val He Pro Ala Glu Thr Gly Gin Glu Thr Ala Tyr Phe Leu 
Leu Lys Leu Ala Gly Arg Trp Pro Val Lys l»hr He His Thr Asp Asn 
Gly Ser Asn Phe Thr Gly Ala Thr Val Arg Ala Ala Cys Trp Trp Ala 
Gly He Lys Gin Glu Phe Gly He Pro Tyr Asn Pro Gin Ser Gin Gly 

10 Val val Glu Ser Met Asn Lys Glu Leu Lys Lys He He Gly Gin Val 
Arg Asp Gin Ala Glu His Leu Lys Thr Ala Val Gin Met ;a.a Val Phe 
He His Asn Phe Lys Arg Lys Gly Gly He Gly Gly Tyr Ser Ala Gly 
Glu Arg He Val Asp He He Ala Thr Asp He Gin Thr Lys Glu Leu 
Gin Lys Gin He Thr Lys He Gin Asn Phe Arg Val Tyr Tyr Arg Asp 

15 Ser Arg Asn Pro Leu Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 
Glu Gly Ala Val Val He Gin Asp Asn Ser Asp He Lys Val Val Pro 
Arg Arg Lys Ala Lys He He Arg Asp Tyr Gly Lys Gin Met Ala Gly 
**sp Asp Cys Val Ala Ser Arg Gin Asp Glu Asp (SEQ ID N0;6) . 

The present invention also relates to a codon optimized HIV-1 Pol mutant 

20 contained within a recombinant adenoviral vector such as lA-Pol (SEQ ID N0:4) 
which comprises a leader peptide at the amino terminal portion of die protein, which 
may effect cellular trafficking and hence^ imraunogenicity of the expressed protein 
within the host cell. Any such adenoviral-based mV-1 DNA pol mutant disclosed in 
the above paragraphs is suitable for fusion downstream of a leader peptide, such as a 

25 leader peptide including but not limited to the human tPA leader sequence. Therefore, 
any such leader peptide-based HIV-1 pol mutant construct may include but is not 
limited to a mutated DNA molecule which effectively alters the catalytic activity of 
the RT, RNase and/or IN region of the expressed protein, resulting in at least 
substantially decreased enzymatic activity one or more of the RT, RNase H and/or IN 

30 functions of HHV-l Pol. In a preferred embodiment of this portion of die invention, a 
leader peptide«3IV-l DNA pol construct contains a mutation or-mutations within the 
Pol coding region which effectively abolishes RT, RNase H and IN activity. An 
especiaUy preferable HIV-1 DNA pol construct is a DNA molecule which contains at 
least one point mutation which alters the active site and catalytic activity within the 

35 RT, RNase H and IN domains of Pol, such tiiat each activity is at least substantially 
aboUshed, and preferably totally abolished. Such a HIV-1 Pol mutant will most likely 
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comprise at least one point mutation in or around each catalytic domain responsible for 
RT, RNase H and IN activity, respectfully. An especially preferred embodiment of 
this portion of the invention relates to a human tPA leader fused to the lA-Pol protein 
comprising the nine mutations shown in Table 1. The DNA molecule is disclosed 
herein as SEQ ID N0:7 and the expressed tPA-IA Pol protein comprises a fusion 
junction as shown in Figure 18, The complete amino acid sequence of the expressed 
protein is set forth in SEQ ID N0:8. To this end, SEQ ID N0:7 discloses the 
nucleotide sequence which codes for a human tPA leader fused to the lA Pol protein 
comprising the nine mutations shown in Table 1 (herein, "tPA-opt-IApol"). The open 
reading frame begins with the initiating Met (nucleotides 8-10) and ternunates with a 
"TAA" codon at nucleotides 2633-2635. The nucleotide sequence encoding tPA- 
lAPol is also disclosed as foUows: 

GATCACCATG GATGCAATGA AGA6AGGGC?r CTGCTGTGTG CTGCTGCTGT GTGGAGCAGT 
CTTCGTTTCG CCCAGCGAGA TCTCCGCCCC CATCTCCCCC ATTGAGACTG TGCCTGTGAA 
GCTGAAGCCT GGCATGGATG GCCCCAAGGT GAAGCAGTGG CCCCT6ACTG AGGAGAAGAT 
CAAGGCCCTG GTGGAAATCT GCACTGAGAT GGAGAAGGAG GGCAAAATCT CCAAGATTGG 
CCCCGAGAAC CCCTACAACA CCCCTGTGTT TGCCATCAAG AAGAAGGACT CCACCAAGTG 
GAGGAAGCTG GTGGACTTCA GGGAGCTGAA CAAGAGGACC CAGGACTTCT GGGAGGTGCA 
GCTGGGCATC CCCCACCCCG CTGGCCTGAA GAAGAAGAAG TCTGTGACTG TGCTGGCTGT 
GGGGGATGCC TACTTCTCTG TGCCCCTGGA TGAGGACTTC AGGAAGTACA CTGCCTTCAC 
CATCCCCTCC ATCAACAATG AGACCCCTGG CATCAGGTAC CAGTACAATG TGCTGCCCCA 
GGGCTGGAAG GGCTCCCCT6 CCATCTTCCA GTCCTCCAT6 ACCAAGATCC TGGAGCCCTT 
CAGGAAGCAG AACCCTGACA TTGTGATCTA CCAGTACATG GCTGCCCTGT ATGTGGGCTC 
TGACCTGGAG ATTGGGCAGC ACAGGACCAA GATTGAGGAG CTGAGGCAGC ACCT6CTGAG 
GTGGGGCCT6 ACCACCCCTG ACAAGAAGCA CCAGAAGGAG CCCCCCTTCC TGTGGATGGG 
CTATGAGCTG CACCCCGACA AGTGGACTGT GCAGCCCATT GTGCTGCCTG AGAAGGACTC 
CTGGACTGTG AATGACATCC AGAAGCTGGT GGGCAAGCTG AACTGGGCCT CCCAAATCTA 
CCCTGGCATC AAGGTGAGGC AGCTGTGCAA GCTGCTGAGG GGCACCAAGG CCCTGACTGA 
GGTGATCCCC CTGACTGAGG AGGCTGAGCT GGAGCTGGCT GAGAACAGGG AGATCCTGAA 
GGAGCCT6TG CATGGGGTGT ACTATGACCC CTCCAAGGAC CTGATTGCTG AGATCCAGAA 
GCAGGGCCAG GGCCAGTGGA CCTACCAAAT CTACCAGGAG CCCTTCAAGA ACCTGAAGAC 
TGGCAAGTAT 6CCAGGATGA GGGGGGCCCA CACCAATGAT GTGAAGCAGC TGACT6AGGC 
TGTGCAGAAG ATCACCACTG AGTCCATTGT GATCTGGGGC AAGACCCCCA AGTTCAAGCT 
GCCCATCCAG AAGGAGACCT GG6AGACCTG GTGGACTGAG TACTGGCAGG CCACCTGGAT 
CCCTGAGTGG GAGTTTGTGA ACACCCCCCC CCTGGTGAAG CTGTGGTACC AGCTGGAGAA 
GGAGCCCATT GTGGGGGCTG AGACCTTCTA TGTGGCTGGG GCTGCCAACA GGGAGACCAA 
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GCTGGGCAAG 


GCTGGCTATG 


TGACCAACAG 


GGGCAGGCAG 


AAGGTGGTGA 


CCCTGACTGA 




CACCACCAAC 


CAGAAGACTG 


CCCTCCAGGC 


CATCTACCTG 


GCCCTCCAGG 


ACTCTGGCCT 




GGAGGT6AAC 


ATTGTGACTG 


CCTCCCAGTA 


TGCCCTGGGC 


ATCATCCAGG 


CCCAGCCTGA 




TCAGTCTGAG 


TCTGAGCTGG 


TGAACCAGAT 


CATTGAGCAG 


CTGATCAAGA 


AGGAGAAGGT 


5 


GTACCTGGCC 


TGGGTGCCTG 


CCCACAAGGG 


CATTGGGGGC 


AATGAGCAGG 


TGGACAAGCT 




G6TGTCTGCT 


G6CATCAGGA 


AGGTGCTGTT 


CCTGGATGGC 


ATTGACAAGG 


CCCAGGATGA 




GCATGAGAA6 


TACCACTCCA 


ACTGGAGG6C 


TATGGCCTCT 


GACTTCAACC 






GGTGGCTAAG 


GAGATTGTGG 


CCTCCTGTGA 


CAAGTGCCAG 


CTGAAGGGCC 


ACGPr ATfiP A 




TGGGCAGGTG 


GACTGCTCCC 


CTGGCATCTG 


GCAGCTGGCC 




TfiC A<?n/2P A a 


10 


GGTGATCCTG 


GTGGCTGTGC 


ATGTGGCCTC 


CGGCTACATT 








TGAGACAGGC 


CAGGAGACTG 


CCTACTTCCT 


GCTGAAGCTG 


GCTGGCAGCT 


/3/2ppaY21V2 A A 




GACCATCCAC 


ACTGCCAATG 


GCTCCAACTT 


CACTGGGGCC 




V« J. 1 AjVi.. 1 V9 




GTGGGCTGGC 


ATCAAGCAGG 


AGTTTGGCAT 


CCCCTACAAC 


CCCCAGTCrC 


At^f^f^PPTV^PT 




GGCCTCCATG 


TkACAAGGAGC 








AGGCTGAGCA 


15 


CCTGAAGACA 


GCTGT6CAGA 


TGGCTGTGTT 


CATCCACAAC 


TTCAAGAGGA 


AGGGGGGCAT 




CGG6GGCTAC 


TCCGCTGGGG 


AGAGGATTGT 


GGACATCATT 


GCCACAGACA 


TCCAGACCAA 




GGAGCTCCAG 


AAGCAGATCA 


CCAAGATCCA 


GAACTTCAGG 


GTGTACTACA 


GGGACTCCAG 




...^ACCCCCTG 


TGGAAGGGCC 


CTGCCAAGCT 


GCTGTGGAAG 


GGGGAGGGGG 


CTGTGGTGAT 




CCAGGACAAC 


TCTGACATCA 


AGGTGGTGCC 


CAGGAGGAAG 


GCCAAGATCA 


TCAGGGACTA 


20 


TGGCAAGCAG 


AT6GCTGGGG 


ATGACTGTGT 


GGCCTCCA6G 


CAGGATGAGG 


ACTAAAGCCC 



GGGCAGATCT (SEQ ID N0:7). 

The open reading frame of the tPA-IA-pol construct disclosed as SEQ ID 
N0:7 contains 875 amino acids, disclosed herein as tPA-IA-Pol and SEQ ID N0:8, as 
follows: 



Met 


Asp 


Ala 


Het 


Lys 


Arg 


Gly 


Leu 


Cys 


Cys Val 


Leu 


Leu 


Leu 


Cys 


Gly 


Ala 


Val 


Phe 


Val 


Ser 


Pro 


Ser 


Glu 


He 


Ser Ala 


Pro 


He 


Ser 


Pro 


He 


Glu 


Thr 


Val 


Pro 


Val 


Lys 


Leu 


Lys 


Pro 


Gly Met 


Asp 


Gly 


Pro 


Lys 


Val 


Lys 


Gin 


Trp 


Pro 


Leu 


Thr 


Glu 


Glu 


Lys 


He Lys 


Ala 


Leu 


Val 


Glu 


He 


Cys 


Thr 


Glu 


Met 


Glu 


Lys 


Glu 


Gly 


Lys 


He Ser 


Lys 


He 


Gly 


Pro 


Glu 


Asn 


Pro 


Tyr 


Asn 


Thr 


Pro 


Val 


Phe 


Ala 


He Lys 


Lys 


Lys 


Asp 


Ser 


Thr 


Lys 


Trp 


Arg 


Lys 


Leu 


Val 


Asp 


Phe 


Arg- 


Glu Leu 


Asn 


Lys 


Arg 


Thr 


Gin 


Asp 


Phe 


Trp 


Glu 


Val 


Gin 


Leu 


Gly 


He 


Pro His 


Pro 


Ala 


Gly 


Leu 


Lys 


Lys 


Lys 


Lys 


Ser 


Val 


Thr 


Val 


Leu 


Ala 


Val Gly 


Asp 


iMa 


Tyr 


Phe 


Ser 


Val 


Pro 


Leu 


Asp 


Glu 


Asp 


Phe 


Arg 


Lys 


Tyr Thr 


Ala 


Phe 


Thr 


He 


Pro 


Ser 


He 


Asn 


Asn 


Glu 


Thr 


Pro 


Gly 


He 


Arg Tyr 


Gin 


Tyr 


Asn 


Val 


Leu 


Pro 


Gin 


Gly 


Trp 


Lys 


Gly 


Ser 


Pro 


Ala 


He Phe 


Gin 


Ser 


Ser 


Met 


Thr 
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Lys lie Leu Glu Pro Phe Arg Lys Gin Asn Pro Asp lie Val He Tyr 
Gin Tyr Met Ala Ala Leu Tyr Val Gly Ser Asp Leu Glu He Gly Gin 
His Arg Thr Lys He Glu Glu Leu Arg Gin His Leu Leu Arg Trp Gly 
Leu Thr Thr Pro Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu Trp 
5 Met Gly Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gin Pro He Val 
Leu pro Glu Lys Asp Ser Trp Thr Val Asn Asp He Gin Lys Leu Val 
Gly Lys Leu Asn Trp Ala Ser Gin He Tyr Pro Gly He Lys Val Arg 
Gin Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val He 
Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu He 
10 Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu 
He Ala Glu He Gin Lys Gin Gly Gin Gly Gin Trp Thr Tyr Gin He 
Tyr Gin Glu Pro Phe Lys Asn Leu Lys Thr Gly Lys Tyr Ala Arg Met 
Arg Gly Ala His Thr Asn Asp Val Lys Gin Leu Thr Glu Ala Val Gin 
Lys He Thr Thr Glu Ser He Val He Trp Gly Lys Thr Pro Lys Phe 
15 Lys Leu Pro He Gin Lys Glu Thr Trp Glu Thr Trp Trp Thr Glu Tyr 
Trp Gin Ala Thr Trp He Pro Glu Trp Glu Phe Val Asn Thr Pro Pro 
Leu val Lys Leu Trp Tyr Gin Leu Glu Lys Glu Pro He Val Gly Ala 
Glu Thr Phe Tyr Val Ala Gly Ala Ala Asn Arg Glu Thr Lys Leu Gly 
Lys Ala Gly Tyr Val Thr Asn Arg Gly Arg Gin Lys Val Val Thr Leu 
20 Thr ASP Thr Thr Asn Gin Lys Thr Ala Leu Gin Ala He Tyr Leu Ala 
Leu Gin Asp Ser Gly Leu Glu Val Asn He Val Thr Ala Ser Gin Tyr 
Ala Leu Gly He He Gin Ala Gin Pro Asp Gin Ser Glu Ser Glu Leu 
val Asn Gin He He Glu Gin Leu He Lys Lys Glu Lys Val Tyr Leu 
Ala Trp Val Pro Ala His Lys Gly He Gly Gly Asn Glu Gin Val Asp 
25 Lys Leu Val Ser Ala Gly He Arg Lys Val Leu Phe Leu Asp Gly He 
Asp Lys Ala Gin Asp Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala 
Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu He Val 
Ala ser Cys Asp Lys Cys Gin Leu Lys Gly Glu Ala Met His Gly Gin 
val Asp Cys Ser Pro Gly He Trp Gin Leu Ala Cys Thr His Leu Glu 
30 Gly Lys Val He Leu Val Ala Val His Val Ala Ser Gly Tyr He Glu 
Ala Glu val He Pro Ala -Glu Thr Gly Gin Glu Thr Ala IVr Phe. Leu 
Leu Lys Leu Ala Gly Arg Trp Pro Val Lys Thr He His Thr Ala Asn 
Gly Ser Asn Phe Thr Gly Ala Thr Val Arg Ala Ala Cys Trp Trp Ala 
Gly He Lys Gin Glu Phe Gly He Pro Tyr Asn Pro Gin Ser Gin Gly 
35 val val Ala Ser Met Asn Lys Glu Leu Lys Lys He He Gly Gin Val 
Arg Asp Gin Ala Glu His Leu Lys Thr Ala Val Gin Met Ala Val Phe 
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He His Asn Phe Lys Arg Lys Gly Gly lie Gly Gly Tyr Ser Ala Gly 
Glu Arg He Val Asp He He Ala Thr Asp lie Gin Thr Lys Glu Leu 
Gin Lys Gin He Thr Lys He Gin Asn Phe Arg Val Tyr Tyr Arg Asp 
Ser Arg Asn Pro Leu Tip Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 
5 Glu Gly Ala Val Val He Gin Asp Asn Ser Asp He Lys Val Val Pro 
Arg Arg Lys Ala Lys He He Arg Asp Tyr Gly Lys Gin Met Ala Gly 
Asp Asp Qys Val Ala Ser Arg Gin Asp Glu Asp (SEQ lb N0:8) . 

EXAMPUE18 

10 CODON OPTIMIZED fflV-l NEF AND CODON OPTIMIZED 

mv-l NEF MODIFICATIONS 
Codon optimized version of HIV-l Nef and HIV-1 Nef modifications arc 
essentially as described in U.S. Application Serial No. 09/738,782, filed 
Decemba- 15, 2000 and PCT International Application PCT/USOO/34162, also filed 

15 December 15, 2000, both documents which arc hereby incorporated by reference. As 
disclosed within the above-mentioned documents, particular embodiments of codon 
optimized Nef and Nef modifications relate to a DNA molecule encoding HIV-1 Nef 
*~wn the HIV-l jfrl isolate wherein tiie codons are optimized for expression in a 
mammalian system such as a human. The DNA molecule which encodes this protein 

20 is disclosed herein as SEQ ID N0:9. while the expressed open reading fiame is 
disclosed herein as SEQ ID NO: 10. Another embodiment of Nef-based coding 
regions for use in the adenoviral vectors of the present invention comprise a codon 
optimized DNA molecule encoding a protein containing the human plasminogen 
activator (tpa) leader peptide fused witii the NHz-terminus of the MV-l Nef 

25 polypeptide. The DNA molecule which encodes this protein is disclosed herein as 
SEQ ID NO:ll, while the expressed open reading frame is disclosed hwein as SEQ 
ID NO: 12. Another modified Nef optimized coding region relates to a DNA molecule 
encoding optimized HIV-1 Nef wherein the open reading frame codes for 
modifications at die amino terminal myristylation site (Gly-2 to Ala-2) and 

30 substitution of the Leu-174-Leu-175 dileucine motif to Ala-174-Ala-175. herein 

described as opt nef (G2A, LLAA). The DNA molecule which encodes this protein is 
disclosed herein as SEQ ID NO:13, while die expressed open reading frame is 
disclosed herein as SEQ ID N0:14. An additional embodiment relates to a DNA 
molecule encoding optimized HTV-l Nef wherein the amino terminal myristylation 

35 site and dDeucine motif have been deleted, as weU as comprising a tPA leader peptide. 
This DNA molecule, opt tpanef (LLAA), comprises an open reading frame which 
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encodes a Nef protein containing a tPA leader sequence fused to amino acid residue 
6-216 of HTV-l Nef (jM), wherein Leu-174 and Uu-175 are substituted with Ala-174 
and Ala-175, herein referred to as opt tpanef (LLAA) is disclosed herein as SEQ ID 
N0:15, while the expressed open reading frame is disclosed herein as SEQ ID NO:16. 

5 As disclosed in the above-identified documents (U.S. AppUcation Serial No. 

09^738,782 and PCX Intranational Application PCTAJSOO/34162) and reiterated 
herein, the following nef-based nucleotide and amino acid sequences which conqoise 
the respective open reading frame are as follows: 

1 The nucleotide sequence of the codon optimized version of HIV-1 jrfl 

10 nef gene is disclosed herein as SEQ ID N0:9. as shown herein: 

GATCT6CCAC CAT6GGC6GC AAGTGGTCCA AGAGGTCCGT GCCCGGCTGG TCCACCGTGA 
GGGA6AGGAT 6AGGAGGGCC GAGCCCGCCG CCGACAGGGT GAGGAGGACC GAGCCCGCCG 
CCGTGGGCGT GGGCGCCGTG TCCAGGGACC TGGAGAAGCA CGGCGCCATC ACCTCCTCCA 
ACACCGCCGC CACCAACGCC GACTGCGCCT GGCTGGAGGC CCAGGAGGAC GAGGAGGTGG 

15 GCTTCCCCGT GAGGCCCCAG GTOCCCCTGA GGCCCATGAC CTACAAGGGC GCCGTG6ACC 
TGTCCCACTT CCTGAAGGAG AAGGGCGGCC TGGAGGGCCT GATCCACTCC CAGAAGAGGC 
AGGACATCCT GGACCTGT6G GT6TACCACA CCCAGGGCTA CTTCCCCGAC TGGCAGAACT 
.HCACCCCCGG CCCCGGCATC AGGTTCCCCC TGACCTTCGG CTGGTGCTTC AAGCTGGTGC 
CCGTGGAGCC C6AGAAGGTG SAGGAGGCCA ACGAGGGCGA GAACAACTGC CTGCTGCACC 

20 CCATGTCCCA GCACG6CATC GAGGACCCC6 A6AAGGAGGT GCTGGAGTGG AGGTTCGACT 
CCAAGCTG6C CTTCCACCAC GTGGCCAGGG ■ AGCTGCACCC CGAGTACTAC AAGGACTGCT 
AAAGCCCGGG C (SEQ ID N0:9) . 

Prefeired codon usage is as follows: Met (ATG), Gly (GGC), Lys (AAG). 
Trp (TGG). Ser (TCC), Arg (AGG), Val (GTG), Pro (CCC), Thr (ACC), Glu (GAG); 

25 Uu (CTGXBis (CAC), lie (ATC), Asn (AAC), Cys CTGC), Ala (GCC), Gin (CAG), 
Phe (TTC) and Tyr (TAG). For an additional discussion relating to mammalian 
(human) codon optimization, see WO 97/31115 (PCrAJS97/02294). which is hereby 
incorporated by reference. See also Figure 19A-B for a comparion of wild type vs. 
codon optimized nucleotides comprising the open reading frame of HIV-Nef. 

30 The open reading frame for SEQ ID N0:9 above comprises an initiating 

methionine-iesidue at nucleotides 12-14 and a 'TAA" stop codon from nucleotides 
660-662. The open reading frame of SEQ ID N0:9 provides for a 216 amino acid 
HIV-1 Nef protein expressed through utilization of a codon optimized DNA vaccine 
vector. The 216 amino acid HIV-1 Nef Qfrl) protein is disclosed herein as SEQ ID 

35 NO:10, and as follows: 

Met Gly Gly Lys Trp Ser Lys Arg Ser Val Pro Gly Trp Ser Thr Val 
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Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Arg Val Arg Arg 
Thr Glu Pro Ala Ala Val Gly Val Gly Ala val Ser Arg Asp Leu Glu 
Lys His Gly Ala lie Thr Ser Ser Asn Thr Ala Ala Thr Asn Ala Asp 
Cys Ala Trp Leu Glu Ala Gin Glu Asp Glu Glu Val Gly Phe Pro Val 
5 Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Val Asp 
Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu He His 
Ser Gin Lys Arg Gin Asp lie Leu Asp Leu Trp Val Tyr His Thr Gin 
Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly He Arg 
Phe Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu Pro 
10 Glu Lys Val Glu Glu Ala Asn Glu Gly Glu Asn Asn Cys Leu Leu His 
Pro Met Ser Gin His Gly He Glu Asp Pro Glu Lys Glu Val Leu Glu 
Trp Arg Phe Asp Ser Lys .Leu Ala Phe His His Val Ala Arg Glu Leu 
His Pro Glu Tyr Tyr Lys Asp Cys (SEQ ID WO: 10) . 

HIV-1 Nef is a 216 amino add cytosolic protein which associates with the 

15 inner surface of the host ceD plasma membrane through myristylation of Gly-2 

(Franchini et al., 1986, Virology 155: 593-599). While not all possible Nef functions 
have been elucidated, it has become clear that correct trafficldng of Nef to the inner 
-Nsma membrane promotes viral replication by altering the host intracellular 
environment to faciUtate the early phase of the fflV-l life cycle and by increasing the 

20 infectivity of progeny viral particles. In one aspect of the invention regarding 
codon-optimized, protein-modified polypeptides, the nef-encoding region of the 
adenovinis vector of the present invention is modified to contain a nucleotide 
sequence which encodes a heterologous leader peptide such that the amino tcirminal 
region of the expressed protein will contain the leader peptide. The diversity of 

25 function that typifies eukaryotic cells depends upon the structural differentiation of 
their membrane boundaries. To generate and maintain these structures, proteins must 
be transported from their site of synthesis in the endoplasmic reticulum to 
predetennined destinations throughout the cell. This requires that the trafficking 
proteins display sorting signals that are recognized by the molecular machinery 

30 responsible for route selection located at the access points to the main trafficldng 
pathways. Sorting decisions for most proteins need to be made only once as they 
traverse their biosynthetic pathways since their final destination, the cellular location 
at which they perform their function, becomes their permanent residence. 
Maintenance of intraceUular integrity depends in part on the selective sorting and 

35 accurate transport of proteins to their correct destinations. Defined sequence motifs 
exist in proteins which can act as 'address labels\ A number of sorting signals have 
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been found associated with the cytoplasmic domains of membrane proteins. An 
effective induction of CTL responses often required sustained, high level endogenous 
expression of an antigen. As membrane-association via myristylation is an essential 
requirement for most of Nefs function, mutants lacking myristylation, by glycine-to- 
alanine change, change of the dileucine motif and/or by substitution with a tpa leader 
sequence as described herein, will be functionally defective, and therefore will have 
improved safety profile compared to wild-type Nef for use as an HIV-1 vaccine 
component 

In another embodiment of this portion of the invention, either the DNA vector 
or the HIV-1 nef nucleotide sequence is modified to include the human tissue-specific 
plasminogen activator (tPA) leader. As shown in Figure 16A-B, a DNA vector may 
be modified by known recombinant DNA methodology to contain a leader signal 
peptide of interest, such that downstream cloning of the modified HTV-l protein of 
interest results in a nucleotide sequence which encodes a modified HIV-l tPA/Nef 
protein. In the altemative, as noted above, insertion of a nucleotide sequence which 
encodes a leader peptide may be inserted into a DNA vector housing the open reading 
frame for the Nef protein of interest Regardless of the cloning strategy, the end result 
is a polynucleotide vaccine which comprises vector components for effective gene 
expression in conjunction with nucleotide sequences which encode a mo^fied HTV-l 
Nef protein of interest, including but not limited to a HIV-1 Nef protein which 
contains a leader peptide. The amino add sequence oftiie human tP A leader utilized 
herein is as follows: MDAMKRGLCCVLLLCGAVFVSPSEISS (SEQ ID N0:17), 

It has been shown that myristylation of Gly-2 in conjunction witii a dileucine 
motif in the carboxy region of the protein is essential for Nef-induced down 
regulation of CD4 (Aiken et al., 1994. Cell 76: 853-864) via endocytosis. It has also 
been shown that Nef expression promotes down regulation of MHCI (Schwartz et al., 
1996, Nature Medicine 2(3): 338-342) via endocytosis. The present invention relates 
in part to DNA vaccines which encode modified Nef proteins altered in trafficking 
and/or functional properties. The modifications introduced into the adenoviral vector 
HTV vaccines of the present invention include but are not limited to additions, 
deletions or substitutions to the nef open reading frame which results in the 
expression of a modified Nef protein which includes an amino terminal leader 
peptide, modification or deletion of the amino terminal myristylation site, and 
modification or deletion of tiie dileucine motif within the Nef protein and which alter 
function within the infected host cell. Therefore, a central theme of tiie DNA 
molecules and recombinant adenoviral HTV vaccines of the present invention is (1) 
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host administration and intracellular delivery of a codon optimized nef-based 
adenoviral HTV vaccine; (2) expression of a modified Nef protein which is 
inmiunogenic in terms of eliciting both CTL and Th responses; and, (3) inhibiting or 
at least altering known early viral functions of Nef which have been shown to 
5 promote HTV-l replication and load within an infected host. Therefore, the nef 

coding region may be altered, resulting in a DNA vaccine which expresses a modified 
Nef protein wherein the amino terminal Gly-2 myristylation residue is either deleted 
or modified to express altemate amino add residues. Also, the nef coding region may 
be altered so as to result in a DNA vaccine which expresses a modified Nef protein 

10 wherein the dileucine motif is either deleted or modified to express altemate amino 
acid residues. In addition, the adenoviral vector HIV vaccines of the present 
invention also relate to an isolated DNA molecule, regardless of codon usage, which 
expresses a wild type or modified Nef protein as described herein, including but not 
limited to modified Nef proteins which comprise a deletion or substitution of Gly 2, a 

15 deletion or substitution of Leu 174 and Leu 175 and/or inclusion of a leader sequence. 
Therefore, specific Nef-based constructs further include the following, as 
exemplification's and not limitations. For example, the present invention relates to an 
adenoviral vector vaccine which encodes modified forais of HTV-l, an open reading 
frame which encodes a Nef protein which comprises a tPA leader sequence fused to 

20 amino acid residue 6-216 of HIV-1 Nef (jfirl) is referred to herein as opt tpanef . The 
nucleotide sequence comprising the open reading fi:ame of opt tpanef is disclosed 
herein as SEQ ID NOcll, as shown below: 





CATGGATGCA 


ATGAAGAGAG 


» 

GGCTCTGCT6 


TGTGCTGCTG 


CTGTGTG6AG 


CAGTCTTCGT 




TTCGCCCAGC 


GAGATCTCCT 


CCAAGAGGTC 


CGTGCCCGGC 


TGGTCCACCG 


TGAGGGA6AG 


25 


GATGAGGAGG 


GCCGAGCCCG 


CCGCCGACAG 


GGTGAGGAGG 


ACCGAGCCCG 


CCGCCGTG6G 




CGTGGGCGCC 


GTGTCCAGGG 


ACCTGGAGAA 


GCACGGCGCC 


ATCACCTCCT 


CCAACACCGC 




CGCCACCAAC 


GCCGACTGCG 


CCTGGCTGGA 


GGCCCAGGAG 


GACGAGGAGG 


TGGGCTTCCC 




CGTGAGGCCC 


CAGGTGCCCC 


TGAGGCCCAT 


GACCTACAAG 


GGCGCCGTGG 


ACCTGTCCCA 




CTTCCTGAAG 


GAGAAG6GCG 


GCCT6GAGG6 


CCTGATCCAC 


TCCCAGAAGA 


GGCAGGACAT 


30 


CCTGGACCTG 


TGGGTGTACC 


ACACCCAGGG 


CTACTTCCCC 


(^CTGGCAGA 


ACTACACCCC 




CGGCCCCGGC 


ATCAGGTTCC 


CCCTGACCTT 


CGGCTGGTGC 


TTCAAGCTGG- 


TGCCCGTGGA 




GCCCGAGAA6 


GTGGAGGA66 


CCAACGAGGG 


CGAGAACAAC 


TGCCTGCTGC 


ACCCCAT6TC 




CCAGCACGGC 


ATCGAGGACC 


CCGAGAAGGA 


GGTGCTGGAG 


TGGAGGTTCG 


ACTCCAAGCT 




GGCCTTCCAC 


CACGTGGCCA 


GGGA6CT6CA 


CCCCGAGTAC 


TACAAGGACT 


GCTAAAGCC 


35 


(SEQ ID NO: 













The open reading frame for SEQ ID NO:l 1 comprises an initiating methionine 
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residue at nucleotides 2-4 and a 'TAA" stop codon from nucleotides 713-715. The 
open reading frame of SEQ ID N0:3 provides for a 237 amino add EHV-l Nef 
protein which comprises a tPA leader sequence fused to amino acids 6-216 of HIV-1 
Nef, including the dileucine motif at amino acid residues 174 and 175. This 237 
5 amino acid tPA/Nef (jfrl) fusion protein is disclosed herein as SEQ ID N0:12. and is 
shown as follows: 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
Ala Val Phe Val Ser Pro Ser Glu lie Ser Ser Lys Arg Ser Val Pro 
Gly Trp Ser Thr Val Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala 

10 Asp Arg Val Arg Arg Thr Glu Pro Ala Ala Val Gly Val Gly Ala Val 
Ser Arg Asp Leu Glu Lys His Gly Ala Xle Thr Ser Ser Asn Thr Ala 
Ala Thr Asn Ala Asp Cys Ala Trp Leu Glu Ala Gin Glu Asp Glu Glu 
Val Gly Phe Pro Val Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr 
Lys Gly Ala Val Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu 

15 Glu Gly Leu He His Ser Gin Lys Arg Gin Asp He Leu Asp Leu Trp 
val Tyr His Thr Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tjrr Thr Pro 
•riy Pro Gly He Arg Phe Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu 
Val Pro Val Glu Pro Glu Lys Val Glu Glu Ala Asn Glu Gly Glu Asn 
Asn Cys Leu Leu His Pro Met Ser Gin His Gly He Glu Asp Pro Glu 

20 Lys Glu val Leu Glu Trp Arg Phe Asp Ser Lys Leu Ala Phe His His 

Val Ala Arg Glu Leu His Pro Glu "Tyr Tyr Lys Asp Cys (SEQ ID NO: 12) . 

Therefore, this exempUfied Nef protein. Opt tPA-Nef , contains both a tPA leader 
sequence as well as deleting the myristylation site of Gly-2A DNA molecule encoding 
HEV-l Nef from the HIV-l jfrl isolate wherein flie codons are optimized for 
25 expression in a mammalian system such as a human. 

In anoAer specific embodiment of the present invention, a DNA molecule is 
disclosed which encodes optimized HIV-1 Nef wherein the open reading frame of a 
recombinant adenoviral HTV vaccine encodes for modifications at tiie amino terminal 
myristylation site (Gly-2 to Ala-2) and substitution of the Uu-174-Leu-175 dileucine 
30 motif to Ala-174-Ala-175. This open reading frame is herein described as opt nef 
(G2AJLLAA) and is disclosed as SEQ ID N0:13, which comprises an initiating 
methionine residue at nucleotides 12-14 and a 'TAA" stop codon from nucleotides 
660-662. The nucleotide sequence of this codon optimized version of HIV-1 jrfl nef 
gene witii the above mentioned modifications is disclosed herdn as SEQ ID NO: 13. 
35 as follows: 
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GATCTGCCAC 


CATGGCCGGC 


AAGTGGTCCA 


AGAGGTCCGT 


GCCCGGCT66 


TCCACCGT6A 


GGGAGAGGAT 


GAG6AGGGCC 


GAGCCCGCCG 


CCGACAGGGT 


GAGGAGGACC 


GAGCCCGCCG 


CC6T6GGCGT 


GGGCGCCGTG 


TCCAGGGACC 


TGGAGAAGCA 


CGGCGCCATC 


ACCTCCTCCA 


ACACCGCCGC 


CACCAACGCC 


GACTGCGCCT 


GGCTGGAGGC 


CCAGGAGGAC 


GAGGA6GTGG 


GCTTCCCCGT 


GAGGCCCCAG 


GTGCCCCTGA 


GGCCCATGAC 


CTACAAGGGC 


GCCGTGGACC 


TGTCCCACTT 


CCTGAAGGAG 


AAGGGCGGCC 


TGGAGGGCCT 


GATCCACTCC 


CAGAAGAGGC 


AGGACATCCT 


GGACCTGTGG 


GTGTACCACA 


CCCAGGGCTA 


CTTCCCCGAC 


TGGCAGAACT 


ACACCCCCGG 


CCCCGGCATC 


AGGTTCCCCC 


TGACCTTCGG 


CTGGTGCTTC 


AAGCTGGTGC 


CCGTGGAGCC 


CGAGAAGGTG 


GAGGAGGCCA 


ACGAGGGCGA 


GAACAACTGC 


GCCGCCCACC 


CCATGTCCCA 


GCACGGCATC 


GAGGACCCCG 


AGAAG6AGGT 


GCTGGAGTGG 


AGGTTCGACT 


CCAAGCTGGC 


CTTCCACCAC 


GT6GCCAGGG 


AGCTGCACCC 


CGAGTACTAC 


AAGGACTGCT 


AAAGCCCGGG 


C (SEQ ID N0:13) . 









The open reading frame of SEQ ID NO: 13 encodes Nef (G2AXLAA), 
disclosed herein as SEQ ID NO: 14, as follows: 

15 Met Ala Gly Lys Trp Ser Lys Arg Ser Val Pro Gly Trp 
Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Arg 
Thr Glu Pro Ala Ala Val Gly Val Gly Ala Val Ser Arg 
Lys His Gly Ala He Thr Ser Ser Asn Thr Ala Ala Thr 
Cys Ala Trp Leu Glu Ala Gin Glu Asp Glu Glu Val Gly 

20 Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly 
Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly 
Ser Gin Lys Arg Gin Asp He Leu Asp Leu Trp Val Tyr 
Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro 
Phe Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro 

25 Glu Lys Val Glu Glu Ala Asn Glu Gly Glu Asn Asn Cys 
Pro Met Ser Gin His Gly He Glu Asp Pro Glu Lys Glu 
Trp Arg Phe Asp Ser Lys Leu Ala Phe His His Val Ala 
His Pro Glu Tyr Tyr Lys Asp Cys Ser (SEQ ID NO: 14) . 

An additional embodiment of the present invention relates to another DNA 

30 molecule encoding optimized HTV-l Nef wherein the amino terminal myristylation 
site and dileucine motif have been deleted, as well as comprising a tPA leader peptide. 
This DNA molecule, opt tpanef (LLAA) comprises an open reading frame which 
encodes a Nef protein containing a tPA leader sequence fused to amino acid residue 
6-216 of HIV-1 Nef Qfrl), wherein Leu-174 and Leu-175 are substituted with Ala-174 

35 and Ala-175 (Ala-195 and Ala-196 in this tPA-based fusion protein). The nucleotide 



Ser 


Thr 


Val 


Val 


Arg 


Arg 


Asp 


Leu 


Glu 


Asn 


Ala 


Asp 


Phe 


Pro 


Val 


Ala 


Val 


Asp 


Leu 


lie 


His 


His 


Thr 


Gin 


Gly 


He 


Arg 


Val 


Glu 


Pro 


Ala 


Ala 


His 


Val 


Leu 


Glu 


Arg 


Glu 


Leu 
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sequence comprising the open leading frame of opt ^anef (LLAA) is disclosed herein 
as SEQ ID NO:15, as shown below: 

CATGGATGCA ATGAAGAGAG GGCTCTGCTG TGTGCTGCTG CTGTGTGGAG CAGTCTTCGT 
TTCGCCCAGC GAGATCTCCT CCAAGAGGTC CGTGCCCGGC TGGTCCACCG TGAGGGAGAG 
GATGAG6AGG GCCGAGCCCG CCGCCGACAG GGTGAGGAGG ACCGAGCCCG CCGCCGTGGG 
CGTGGGCGCC GTGTCCAGGG ACCTGGAGAA GCACGGCGCC ATCACCTCCT CCAACACCGC 
CGCCACCAAC GCCGACTGCG CCTGGCTGGA GGCCCAGGAG GACGAGGAGG TGGGCTTCCC 
CGTGAGGCCC CAGGTGCCCC TGAGGCCCAT GACCTACAAG GGCGCCGTGG ACCTGTCCCA 
CTTCCTGAAG GAGAAGGGCG GCCTGGAGGG CCTGATCCAC TCCCAGAAGA GGCAGGACAT 
CCTGGACCTG TGGGTGTACC ACACCCAGGG CTACTTCCCC GACTGGCAGA ACTACACCCC 
CGGCCCCGGC ATCAGGTTCC CCCTGACCTT CGGCTGGTGC TTCAAGCTGG TGCCCGTGGA 
GCCCGA6AAG GTGGAGGAGG CCAACGAGGG CGAGAACAAC TGC6CCGCCC ACCCCATGTC 
CCAGCACGGC ATCGAGGACC CCGAGAAGGA GGTGCTGGAG TGGAGGTTCG ACTCCAAGCT 
GGCCTTCCAC CACGTGGCCA GGGAGCTGCA CCCCGAGTAC TACAAGGACT GCTAAAGCCC 
<SEQ ID NO: 15) . 

The open reading frame of SEQ ID NO:7 encoding tPA-Nef (LXAA), 
disclosed herein as SEQ ID N0:16, is as follows: 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
Ala Val Phe Val Ser Pro Ser Glu He Ser Ser Lys Arg Ser Val Pro 
Gly Trp Ser Thr Val Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala 
Asp Arg Val Arg Arg Thr Glu • Pro Ala Ala Val Gly Val Gly Ala Val 
Ser Arg Asp Leu Glu Lys His Gly Ala He Thr Ser Ser Asn Thr Ala 
Ala Thr Asn Ala Asp Cys Ala Trp Leu Glu Ala Gin Glu Asp Glu Glu 
Val Gly Phe Pro Val Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr 
Lys Gly Ala Val Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu 
Glu Gly Leu He His Ser Gin Lys Arg Gin Asp He Leu Asp Leu Trp 
Val Tyr His Thr Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro 
Gly Pro Gly He Arg Phe Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu 
Val Pro Val Glu Pro Glu Lys Val Glu Glu Ala Asn Glu Gly Glu Asn 
Asn Cys Ala Ala His Pro Met Ser Gin His Gly He Glu Asp Pro Glu 
Lys Glu Val Leu Glu Trp Arg Phe Asp Ser Lys Leu Ala Phe His His 
Val Ala Arg Glu Leu His Pro Glu Tyr Tyr Lys Asp Cys (SEQ ID N0:16) . 
An adenoviral vector of the present invention may comprise a DNA sequence, 
regardless of codon usage, which expresses a wild type or modified Nef protein as 
described herein, including but not limited to modified Nef proteins which comprise a 
deletion or substitution of Gly 2, a deletion of substitution of Leu 174 and Leu 175 
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and/or inclusion of a leader sequence. Therefore, partial or fully codon optimized 
DNA vaccine expression vector constructs are preferred since such constructs should 
result in increased host expression. However, it is within the scope of the present 
invention to utilize "non-codon optimized" versions of the constructs disclosed herein, 

5 especially modified versions of HTV Nef which are shown to promote a substantial 
cellular immune response subsequent to host administration* 

Figure 20A-C show nucleotide sequences at junctions between nef coding 
sequence and plasmid backbone of nef expression vectors VI Jns/nef (Figure 20A), 
VI Jns/nef(G2A,lXAA) (Figure 20B). VI Jns/tpanef ff^igure 20C) and 

10 VI Jns/tpanef(ULAA) (Figure 20C, also). 5' and 3* flanking sequences of codon 

optimized nef or codon optimized nef mutant genes are indicated by bold/italic letters; 
nef and nef mutant coding sequences are indicated by plain letters. Also indicated (as 
underlined) are the restriction endonuclease sites involved in construction of 
respective nef expression vectors, VI Jns/tpanef and VlJns/tpanef(LLAA) have 

IS identical sequences at the junctions. 

Figure 21 shows a schematic presentation of nef and nef derivatives, Amino 
acid residues involved in Nef derivatives are presented. Glycine 2 and Leucine 174 
aiiu 1 75 are the sites involved in myristylation and dileucine motif, respectively. 

20 EXAMPLE 19 

MRKAdSPol Construction and Virus Rescue 
Construction of vector: shuttle plasmid and pre-adenovirus plasmid- Key 
steps perfonned in the construction of the vectors, including the pre-adenovhus 
plasmid denoted MRKAdSpol, is depicted in Figure 22. Briefly, the adenoviral shuttle 

25 vector for the full-length inactivated HIV-1 pol gene is as follows. The vector 

MRKpdelEl(Pac/pIX/pack450)+CMVmin+BGHpA(str.) is a derivative of the shuttle 
vector used in the construction of the MRKAdSgag adenoviral pre-plasmid. The 
vector contains an expression cassette with the hCMV promoter (no intronA) and the 
bovine growth hormone polyadenylation signal. The expression unit has been 

30 inserted into the shuttle vector such that insertion of the gene of choice at a unique 
BglR site will ensure the direction of transcription of the transgene will be Ad5 El 
parallel when inserted into the MRKpAd5(El-/E3+)Clal (or MRKpAdHVE3) pre- 
plasmid. The vector, similar to the original shuttle vector contains the Pacl site, 
extension to the packaging signal region, and extension to the pIX gene. The synthetic 

35 fulUength codon-optimized HIV-1 pol gene was isolated directly from the plasmid 
pVlJns-HIV-pol-inact(opt). Digestion of this plasmid with Bgl n releases the pol 
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gene intact (comprising a codon optimized lA pel sequence as disclosed in SEQ ID 
N0:3). The pel fragment was gel purified and ligated into the 
MRKpdelEl(Pac/prX/pack450)+CMVmin+BGHpA(str.) shuttle vector at the BgCtt 
site. The clones were checloed for the correct orientation of the gene by using 
5 lesiriction enzymes DrdSilNotl . A positive clone was isolated and named 

MRKpdel+hCMVmin+FI^pol+bGHpA(s). The genetic structure of this plasmid was 
verified by PGR, restriction enzyme and DNA sequencing. The pre-adenovirus 
plasmid was constructed as follows. Shuttle plasmid MRKpdel+hCMVmin+FL- 
pol+bGHpA{S) was digested with restriction enzymes Pad and BstlWl I (or its 
10 isoschizomer, BsrZlO? I) and then co-transformed into E. coli strain BJ5183 with 
Uneaiized (Ctol digested) adenoviral backbone plasmid, MRI^AdCEl-/E3+X3al. 
The resulting pre-plasmid ori^nally named MRKpAd+hCMVmin+FL- 
pol+bGHpA(S)E3+ is now referred to as "pMRKAdSpol". The genetic stracture of 
the resulting pMRKAdSpol was verified by PGR, restriction enzyme and DNA 
15 sequence analysis. The vectors were transformed into competent E. coli XL-1 Blue 
for preparative production. The recovered plasmid was verified by restriction enzyme 
-di gestion and DNA sequence analysis, and by expression of the pol transgene in 
transient transfection cell culture. The complete nucleotide sequence of tiiis 
pMRKAdSHTV-lpol adenoviral vector is shown in Figure 26 A-AO. 
20 Generation of research-grade recombinant adenovirus - The pre-adenovirus 

plasmid, pMRKAdSpol, was rescued as infectious virions in PER.G6* adherent 
monolayer cell culture. To rescue infectious virus, 12 |ig of pMRKAdSpol was 
digested with restriction enzyme PocI (New Engjand Biolabs) and 3.3 jig was 
transfected per 6 cm dish of PER.C6® ceUs using the calcium phosphate co- 
25 precipitation technique (Cell Phect Transfection Kit, Amersham Pharmacia Biotech 
Inc.). Pacl digestion releases the viral genome fi^m plasmid sequences allowing viral 
replication to occur after entry into PER.C6®ceUs. Infected ceUs and media were 
harvested 6 -10 days post-transfection, after complete viral cytopathic effect (GPE) 
was observed. Infected cells and media were stored at < -eO^C. This pol containing 
30 recombinant adenovirus is referred to herein as "MRKAdSpol". This recombinant 
adenovirus expresses an inactivated HIV-1 Pol protein as shown in SEQ ID NO:6. 

EXAh4PLE20 
MRKAdSNef Construction and Virus Rescue 
35 Construction of vector: shuttle plasmid and pre-adenovirus plasmid - Key 

steps perfonned in the constraction of the vectors, including the pre-adenovirus 
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plasmid denoted MRKAdSnef, is depicted in Figure 23. Briefly, as shown in 
Example 19 above> the vector 

MRKpdelEl(Pac/pIX/pack450)-fCMVmin+BGHpA(str.) is the shuttle vector used in 
the construction of the MRKAdSgag adenoviral pre-plasmid. It has been modified to 
5 contain the Pacl site, extension to the packaging signal region, and extension to the 
pIX gene. It contains an expression cassette with the hCMV promoter (no intronA) 
and the bovine grov^oh hormone polyadenylation signal. The expression unit has been 
inserted into the shuttle vector such that insertion of the gene of choice at a unique 
Bgll 1 site will ensure the direction of transcription of the transgene will be Ad5 El 

10 parallel when inserted into die MRKpAd5(El-/E3+)Clal pre-plasmid. The synthetic 
fuU-lenglh codon-optimized HTV-l nef gene was isolated directly from the plasmid 
pVl Jns/nef (G2A,LLAA). Digestion of this plasmid with Bglll releases the pol gene 
intact, which comprises the nucleotide sequence as disclosed in SEQ ID N0:13. The 
nef fragment was gel purified and ligated into the 

15 MRKpdelEl+CMVmin+BGHpA(str.) shuttle vector at the Bgll 1 site. The clones 

were checked for correction orientation of the gene by using restriction enzyme Seal. 
A positive clone was isolated and named MRKpdelElhCMVminFL-nefBGHpA(s). 
ane genetic structure of this plasmid was verified by PCR, restriction enzyme and 
DNA sequencing. The pre-adenovirus plasmid was constructed as follows. Shuttle 

20 plasmid MRKpdeIElhCMVminFL-nefBGHpA(s) was digested with restriction 

enzymes Pacl and BstllOl r(or its isoschizomer, BsiZlOl J) and then co-transformed 
into E, coli strain BJ5183 with linearized {Oal digested) adenoviral backbone 
plasmid, MRKpAd(El/E3+)Clal. The resulting pre-plasmid originally named 
MRKpdelElhCMVminEL-nefBGHpA(s) is now referred to as "pMRKAdSnef. The 

25 genetic stmcture of the resulting pMRKAdSnef was verified by PCR, restriction 

enzyme and DNA sequence analysis. The vectors were transformed into competent £. 
coli XLrl Blue for preparative production. The recovered plasmid was verified by 
restriction enzyme digestion and DNA sequence analysis, and by expression of die 
nef transgene in transient transfection cell culture. The complete nucleotide sequence 

30 of this pMRKAd5HIV-lnef adenoviral vector is shown in Figure 27 A-AM. 

Generation of research-grade recombinant adenovirus - The pre-adenovirus 
plasmid, pMRKAdSnef, was rescued as infectious virions in PER,C6® adherent 
monolayer cell culture. To rescue infectious virus, 12 jxg of pMRKAdnef was 
digested with restriction enzyme Pacl (New England Biolabs) and 33 ^g was 

35 transfected per 6 cm dish of PER.C6® cells using the calcium phosphate co- 
precipitation technique (Cell Phect Transfection Kit, Amersham Pharmacia Biotech 
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Inc.). Pacl digestion releases the viral genome from plasmid sequences allowing 
viral repUcation to occur after entry into PER.C6®ceUs. Infected ceUs and media were 
harvested 6 -10 days post-transfection, after complete viral cytopathic effect (CPE) 
was observed. Infected ceUs and media were stored at < -60*0. This nef containing 
recombinant adenovirus is now referred to as "MRKAdSnef 

EXAMPLE 21 

Construcdon of Murine CMV Promote Containing Shuttle Vectors 
for Inactivated Pol and NeffG2A J-LAA 
Hie murine CMV (mCMV) was ampUfied ftom the plasmid pMH4 (supplied 
by Frank Graham. McMaster University) using the primer set: mCMV {Not I) 
Fbrward: 5'-ATA AOA a tOCGGCCGC CA TAT ACT GAG TCA TTA GG-3' 
(SEQ ID NO: 20); mCMV {Bgl II)Reverse: 5'-AAG GAAGATCTA CCG ACQ 
CTG GTC GCG CCT C-3' (SEQ ID N0:21). The underlined nucleotides represent 
the Not I and the Bgl n sites respectively for each primer. This PGR amplicon was 
used for the constroction of the mCMV shutde vector containing the transgene in the 
PI parallel orientation. The hCMV promoter was removed from the original shutUe 
vector (containing the hCMV-gag-bGHpA transgene in the El parallel orientation) by 
digestion witti Not I and Bgl H. The mCMV promoter {Not VBgl H digested PCR 
product) was inserted into the shuttie vector in a directional manner. The shuttle 
vector was then digested with Bgl H and die gag reporter gene (Bgl H fragment) was 
re-inserted back into the shuttle vector. Several cl<Mies were screened for correct 
orientation of die reporter gene. For die construction of die mCMV-gag in the El 
antiparaUel orientation, die mCMV promoter was amplified from die plasmid pMH4 
using die following primer set: mCMV (Arc I) Forward: 5'. ATA AGA ATGfiCG 
gGCC AT ATA CTG AGT CAT TAG G (SEQ ID NO:22); mCMV (Bgl II) Reverse: 
5* AAG GAAGATCTA CCG ACG CTG GTC GCG CCT C (SEQ ID NO:23). The 
underUned nucleotides represent die Asc I and Bgl H sites, respectively for each 
primer. The shuttie vector containing die hCMV-gag transgene in tiie El antiparallel 
orientation was digested witii Ascl and Bglll to remove die hCMV-gag portion of die 
transgene. The mCMV promoter (AscVBglll digested PGR product) was inserted 
into die shuttie vector in a directional manner. The vector was tiien digested widi 
Bgin and die gag reporter gene {Bglll fragment) was re-inserted. Several clones 
were screened for correct orientation of die reporter gene. For each of die full lengtii 
lA pol and full lengdi nef/GZAJIAA genes, cloning was performed using die unique 
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Bgl n site within the mCMV-bGHpA shuttle vector. The pol and nef genes w«e 
excised from their respective pVl Jns plasmids by Bgl U digestion. 

EXAMPLE 22 

5 Construction of mCMV Full Length Inactivated Pol and 

Full Length nef/G2A.LLAA Adenovectors 
Each of these transgenes of Example 21 were inserted into the modified 
shuttle vector in both the El parallel and El anti-parallel orientations. Pad and 
BstZl 101 digestion of each shuttle vector was performed and each specific transgene 
10 fragment containing the flanking Ad5 sequences was isolated and co-transfonned 
with Cla I digested MRKpAd5^+) or MRKpAd5(E3-) aderiovector plasmids via 
bacterial homologous recombination in BJ5183 E. coli cells. Recombinant pie- 
plasmid adenovectors containing the various transgenes in both the E3- andE3+ 
versions (and in the El parallel and El antiparallel orientations) were subsequently 
15 prepared in large scale following transformation into XL-1 Blue E. coli cells and 
analyzed by restriction analysis and sequencing. 

EXAMPLE 23 
Construction of hCMV-tpa-nef (ULAA) Adenovector 

20 The tpa-nef gene was amplified out from GMP grade pVl Jns-tpanef (LLAA) vector 
using the primer sets: Tpanef (BaraHI) F 5*-ATT GGATCC ATG GAT GCA ATG 
AAG AGA GGG (SEQ ID 24); Tpanef (BamHI) R 5^-ATA GGATCC TTA GCA 
GTC CTT GTA GTA CTC G (SEQ ID NO:25). The resulting PCR product was 
digested with So/nHI, gel purified and cloned into the Bgl n site of MRKAdSCMV- 

25 bGHpA shuttle vector (Bgl II digested and calf intestinal phosphatase treated). 

Clones containing the tpanef (LLAA) gene (see SEQ ID NO:15 for complet coding 
region) in the correct orientation with respect to the hCMV promoter were selected 
following Sea I digestion. The resulting MRKAdStpanef shutUe vector was digested 
with Pac I and Bst Zl 101 and cloned into the E3+ MRKAdS adenovector via bacterial 

30 homologous recombination techniques. 

EXAMPLE 24 

Immunogenicity of MRKAdSpol and MRKAdSnef Vaccine 
Materials and Methods - Rodent Immunization - Groups of N=10 BALB/c 
35 mice were immunized i.m. with the following vectors: (1) MRKAdShCMV-IApol 
(E3+) at either 10^7 vp and 10^9 vp; and (2) MRKAdShCMV-IApol (E3-) at either 
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10'^7 vp and 10^9 vp. At 7 weeks post dose, 5 of the 10 mice per cohort were boosted 
with the same vector and dose they initially lecaved. At 3 weeks post the second . 
does, sera and spleens were coDected from all the animals for RT ELISA and IFNg 
EUspot analyses, respectively. For aU rodent immunizations, the Ad5 vectors were 
dUuted in 5 mM Tris, 5% sucrose, 75 mM NaCl, 1 mM MgC12, 0.005% polysorbate 
80. pH 8.0. The total dose was injected to both quadricep muscles in 50 pL aliquots 
using a 0.3.mL insulin syringe with 28-1/2G needles (Becton-Dickinson, Franklin 
Lakes, NJ). 

Groups of N=10 C57/BL6 mice were immunized i.m. with the following 
vectors: (1) MRKAd5hCMV-nef(G2AMAA) 0E3+) at either 10^7 vp and 10^9 vp; 
(2) MRKAd5mCMV-nef(G2A,LLAA) (E3+) at either 10^7 vp and 10^9 vp; and (3) 
MRKAd5mCMV-tpanef(LLAA) (E3+) at either 10^7 vp and 10^ vp. At 7 weeks 
post dose, 5 of the 10 mice per cohort were boosted with the same vector and dose 
they initially recdved. At 3 weeks post the second does, sera and spleens were 
IS collected from all the animals for RT EUSA and IFNg EUspot analyses, 
respectively. 

Non-human Primate immunization - Cohorts of 3 rhesus macaques (2-3 kg) 
were vaccinated with the following Ad vectors: (1) MRKAd5hCMV-IApol (E3+) at 
either 10'^9 vp and lO'^U vp dose; and (2) MRKAdShCMV-IApol (E3-) at rather 

20 10^9 vp and 10^11 vp; (3) MRKAd5hCMV-nef(G2AjLLAA) (E3+) at either 10^9 vp 
and lO'^ll vp; and (4) MRKAd5mCMV-nef(G2AJLLAA) {E3+) at either 10^ vp and 
10^11 vp. The vaccine was administered to chemically restrained monkeys (10 
mg/kg ketamine) by needle injection of two 0.5 mL aUquots of the Ad vectors (in 5 
mM Tris, 5% sucrose, 75 mM NaQ. 1 mM MgQj, 0.005% polysorbate 80, pH 8.0) 

25 into both deltoid muscles. The animals were immunized twice at a 4 week interval 
(T=0,4weeks). 

Murine anti-RT and onti-nefEUSA - Anti-RT titers were obtained following 
standard secondary antibody-based ELISA. Maxisorp plates (NUNC. Rochester, NY) 
were coated by overnight incubation with 100 mL of IjAg /mL HIV-1 RT protein 

30 (Advanced Biotechnologies, Columbia, MD) in PBS. For anti-nef EUSA. 100 uL of 1 
ug/mL HIV-1 nef (Advanced Biotechnologies, Columbia, MD) was used to coat the 
plates. The plates were washed with PBS/0.05% Tween 20 using TitertekMAP 
instrument (Hunstville. AL) and incubated for 2 h with 200 jdJwell of blocking 
solution (PBS/0.05% tween/1% BSA). An initial serum dilution of 100-fold was 

35 performed foUowed by 4-fold serial dilution. lOO-jiL aliquots of serially diluted 
samples were added per well and incubated for 2 h at room temperature. The plates 
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were washed and 100 jiL of 1/1000-diluted HRP-rabbit anti-mouse IgG (ZYMED, 
San Francisco, CA) were added with 1 h incubation. The plates were washed 
thoroughly and soaked with 100 jiL 1,2-phenylenediannne dihydrochloride/hydrogen 
peroxide PAKO, Norway) solution for 15 min. The reaction was quenched by 
5 adding 100 pL of 0.5M H2SO4 per well. OD492 readings were recorded using Titertek 
Multiskan MCC/340 with S20 stacker. Endpoint titers were defined as the highest 
serum dilution that resulted in an absorbance value of greater than or equal to 0. 1 
OD492 (2.5 times the background value). 

Non-human primate and murine EUspot assays - The enzyme-linked 

10 immuno-spot (ELISpot) assay was utilized to enumerate antigen-specific INPy- 
secreting cells from mouse spleens (Miyahira, et al.l995, /. Immunol Methods 
181:45-54) or macaque PBMCs. Mouse spleens were pooled from 5 mice/cohort and 
single cell suspensions were prepared at SxloVmL in complete RPMI media 
(RPMI1640, 10% FBS, 2mM L-glutamine. lOOU/mL Penicillin, 100 u/mL 

15 streptomycin, 10 mM Hepes, 50 uM P-ME). Rhesus PBMCs were prepared from 8- 
15 mL of heparinized blood following standard Ficoll gradient separation (Coligan, et 
al, 1998, Current Protocols in Immunology. John Wiley & Sons, Jnc). Multiscreen 
.^dque plates (MUlipore, France) were coated with 100 jiL/well of either 5 jig/mL 
purified rat anti-mouse IFN-y IgGl, clone R4-6A2 (Phaimingen, San Diego, CA), or 

20 15 ug/mL mouse anti-human IFN-y IgGia (Cat No. 1598-00, R&D Systems, 
Minneapolis, MN) in PBS at 4°C overnight for murine or monkey assays, 
respectively. The plates were washed with PBS/penicillin/streptomycmand blocked 
with 200 pL/well of complete RPMI media for 37 °C for at least 2 h. 

To each well, 50 jiL of ceU samples (4-5x10^ cells per well) and 50 fxL of the 

25 antigen solution were added. To tiie control well. 50 jiL of the media containing 

DMSO were added; for specific responses, either selected peptides or peptide pools (4 
ug^mL per peptide final concentration) were added. For B ALB/c mice immunized 
with the pol constructs, stimulation was conducted using a pool of CIM^-epitope 
containing 20-mcr peptides (aa21-40, aa411-430, aa641-660, aa731-750, aa771-790) 

30 or a pool of CD8+-epitope containing peptides (aa201-220, aa311-330. aa781-800). 
For C57/BL6 mice immunized with the nef construct, either aa51-70 (CDS* T cell 
epitope) or aa81-100 (CD4^ peptide derived from the nef sequence was added for 
specific stimulation. In monkeys, the responses against pol were evaluated using two 
pools (L and R) of 20-aa peptides that encompass the entire pol sequence and overlap 

35 by 10 amino acids. In monkeys vaccinated with the nef constructs, a single pool 
containing 20-mer peptides covering the entire MV-l nef sequence and overlapping 
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by 10 aa was used. Each sample/antigen mixture was pcrfomed in tripUcate wells for 
murine samples or in duplicate weUs for rhesus PBMCs. Plates were incubated at 
37»C. 5% CO2. 90% humidity for 20-24 h. The plates were washed with PBS/0.05% 
Twe^ 20 and incubated with 100 ML'well of either 1.25 ^g/^lL biotin-conjugated rat 
5 anti-mouse IFN-y mAb. clone XMG1.2 (Pharmingen) or of 0.1 ug/mL biotinylated 
anti-human IFN-gamma goat polyclonal antibody (R&D Systems) at 4°C overnight. 
•n,e plates were washed and incubated with 100 filVwell 1/2500 dilution of 
strepavidin-alkaline phosphatase conjugate (Pharmingen) in PBS/0.005% Tween/5% 
FBS for 30 min at 37 "C. Spots were developed by incubating with 100 jjUwell 1- 
10 step NBT/BCIP (Pierce Chemicals) for 6-10 min. The plates were washed with water 
and allowed to air dry. The number of spots in each weU was determined using a 
dissecting microscope and the data normaUzed to lO' cell input. 

Non-human Primate anH-RTEUSA - The pol-specific antibodies in the 
monkeys were measured in a competitive RT EIA assay, wherein sample activity is 
15 determined by the ability to block RT antigen from binding to coating antibody on the 
plate well. Briefly. Maxisorp plates were coated with saturating amounts of pol 
T-.^Hve human serum (#97111234). 250 uL of each sample is incubated witfi 15 uL 
of 266 ng/mL RT recombinant protein (in RCM 563. 1% BSA. 0.1% tween. 0.1% 
NaNs) and 20 uL of lysis buffer (Coulter p24 antigen assay kit) for 15 min at room 
20 temperamre. Similar mixtures are prepared using serially diluted samples of a 

standard and a negative control which defines maximum RT binding. 200iiL/well of 
each sample and standard were added to die washed plate and the plate incubated 16- 
24 h at room temperatiire. Bound RT is quantified following the procedures described 
in Coulter p24 assay kit and reported in milliMerck units per. mL arijitrarily defined 

25 by the chosen standard. 

Results - RodeiU Studies - B ALB/c mice (n=5 mice/cohort) were immunized 
once or twice with varying doses of MRKAd5hCMV.IApol(E3+) and 
MRKAd5hCMV-IApol(E3-). At 3 weeks after the second dose. Anti-pol IgG levels 
were determined by an BUS A assay using RT as a surrogate antigen. Cellular 

30 response were quantified via IFNy EUSpot assay against pools of pol-epitope 

containing peptides. The results of these assays are summarizedin Table 10. The . 
results indicate that the mouse vaccinees exhibited detectable anti-RT IgGs with an 
adenovector dose as low as 10^7 vp. The humoral responses are highly dose- 
dependent and are boostable with a second immunization. One or two doses of either 

35 pol vectors eUcit high frequencies of antigen-spedfic CD4* and CDS* T ceUs; the 
responses are weakly dose-dependent but are boostable witii a second irmnunization. 



87 



wo 02/022080 



PCT/USOl/28861 



Table 


10. Immunogenicir 


/of MI 


IKAdS 


pol Vectors in BALB/c mice. 


Group 


Vacdne 


0O88 


No. of 
Doses 


Antt-RT IgG Titers* 


SFaiD^G cells* 


GUT 


+SE 


•SE 


Medium 


CD4* 
p6ptkSo 

DOOl 


peptide 

DOOl 


1 


MFU(AdahCMVFL|»i {E3t) 


10^7 vp 


2 
1 


310410 
819 


301785 
372 


163020 
285 


1(1) 
1(1) 


75(4) 
72(0) 


2313(87) 
533(41) 


2 


MRKAdShCMVFLpol (B^f 


10^ vp 


2 
1 


ie3840(f 
7131S5 


0 

528520 


0 

303565 


2(2) 
1(1) 


114(0) 
48(7) 


2083(182} 
733(89) 


3 


MRKAdShCMVapol (63-) 


10^7 vp 


2 
1 


310419 
6400 


388218 
14013 


172097 
4393 


0(0) 
10(8) 


223(7) 
141(21) 


2607(27) 
409(28) 


A 

5 


MHKAdShCMVFLpo) (E3-) 

Naive ' 


none 


2 
1 

none 


ie3B40Cf 
1241675* 

57 


0 

398725 
9 


0 

300861 
7 


1(^> 
0(0) 

fU9\ 


160(13) 
39(13) 


2385(11) 
633(83) 



» a " ^- — — w^, wwHMw aiiwi wi wic ymieuic mean 

''Near or at the upper limit of the serial dilution; hence, could be greater than ttita vahxa 

"No. of Spot-fofinhB Cells per mMon eplecnoytes; mean vahias of Iripllcaiss ara rBpartad along WWi standard errora In paianthasit. 

C57/BL6 mice were immunized once or twice with varying doses of 
MRKAd5hCMV-nef(G2A,LLAA) (E3+), MRKAd5mCMV-nef(G2AJXAA) (E3+) 
at either lO'^? vp and(3) MRKAd5mCMV-tpanef(LLAA) (E3+) at either 10^7 vp and 
10^9 vp. The immune response were analyzed using similar protocols and the results 
are listed in Table 11. While anti-nef IgG responses could not be detected in this 
model system with any of the constructs, there are strong indications of a cellular 
inmimiity generated against nef using the ELIspot assay. 



Table 11. Immunogenicity of MRKAdSnef Vectors in C57/BL6 mice 



Group 


VaccbiB 


Dose 


No. of 

Doses 


At 


t)-nef IgG ntera' 


SFC/10*6 cella' 


am 


4-SE 




Medium 


aa5V70 


C044 


1 


MRKAdShCMVFUief (E3^) 


10^7 vp 


2 
1 


174 
132 


70 
42 


SO 
32 


1(1) 
0(0) 


23(1) 

m 


1(1) 
0(0) 


2 


MRKAdShCMVFUiaf (E3f ) 


io«evp 


2 
1 


174 
132 


70 
42 


50 
32 


0(0) 

1(1) 


61(7) 
62(7) 


4(2) 
3(1) 


3 


MRKAd5mGMVFLiMl (E3f ) 


10^7 vp 


2 
1 


132 

lis 


42 
46 


32 
33 


3(1) 
3(2) 


15(5) 
3(2) 


5(2) 
4(2) 


4 


MRKAdSmCMVFl/iel (ESf ) 


10^ vp 


2 
1 


132 
132 


42 
42 


32 
32 


4(2) 
2(1) 


83(13) 
29(2) 


8(1) 
4(0) 


5 


MRKAd5mCMVlpanel(E34)~ 


10^7 vp 


2 
1 


132 
100 


42 
0 


32 
0 


3(2) 
3(1) 


14(2) 
13(4) 


8(1) 
10(3) 


6 
7 


MRKAd5mCMVIpanaf(E3f) 
NiRve 


10^ vp 

none 


2 
1 


230 

lis 

152 


170 
46 

78 


98 
33 


3(2) 
7(1) 


146(29) 
151(14) 

LiaaJ 


4(0) 
10(0) 


•GMT.flB 
Wofap 


omeiric mean tiler of the cohort o 
o^fornihg oalts permODon sptocn 


5 mica; SE, 
oyta8;mean 


standard er 
valinsof tri 


ror ot the gem 
pUcates are net 


9lrio mean 
MJ/tedahmg wl 


52 . 
iihalandarden 


orsflfiparen» 


nata. 


26(3) 



Monkey Studies - Cohorts of 3 rhesus macaques were immunized with 2 doses 
of MRKAd5hCMV-IApol(E3+) and MRKAd5hCMV-IApol(E3-). The number of 
antigen-specific T cells (per million PBMCs) were enumerated using one of two 



88 



wo 02/022080 



PCT/USOl/28861 



peptide pools CL and R) that cover the entire pol sequence; the results are Usted in 
Table 12. Moderate-to-strongT cell responses were detected in the vaccinees using 
either constructs even at a low dose of 10'^9 vp. Longitudinal analyses of the anti-RT 
antibody titers in the animals suggest that the pol transgene product is expressed 
efficiently to eUdt a humoral response (Table 13). It would appear that generaUy 
higher immune responses were observed in animals that received the E3- construct 
compared to the E3+ virus. 

Table 12. Pol-specific T CeU Responses in MRKAdSpol Immunized Rhesus 



Macaques. 


Memk# 




T=4 


To7 




Ta18 




Vaccine fr«s0,4 wKsi 
MRKAe6hCM^-lApd(^3+) 


99aoo 

99C215 
99D201 


1 
1 

5 


PoiL 
0 
2 
5 


PoIR 
0 
2 
4 


Mock 
1 

10 
6 


PolL 
3B 
B8 
149 


PoIR 
31 
249 
85 


KAock 
0 
1 
0 


PotL 
52 
109 
40 


PoIR 
146 
305 
39 


MOCK 

0 ' 

22 
0 


POl L 

49 
68 
35 

6 


PoIR 
716 
250 
18 

6 




Q9D212 
99D180 
99C2D1 


0 
0 
8 


2 
4 

S 


0 
2 
21 


4 
0 

e 


331 
19 
02 


114 
1B2 
82 


0 
4 
0 


58 
38 
18 


14 

153 

32 


0 

5 
1 


36 
14 


108 
66 

40 


lONIVp 


99D239 
990166 
99C084 


S 
4 
1 


2 
12 
8 


2 
6 
9 


20 
5 
8 


82 
120 
84 


172 
421 
464 


1 

2 
0 


66 
271 
14 


114 

489 
238 


9 

\6 
1 


21 
875 
24 


530 
264 

176 


1 fJrfuo 


0C7C 
CD1G 
CDll 

0B3Q 


10 
2 
6 

nd 


10 
0 

6 

1 nd 


B 
1 

12 
nd 


12 
5 
10 

nd 


724 
474 
BB 

nd 


745 
468 
110 

nd 


4 

0 

5 

4 


232 
60 

2 


376 
212 
80 

2 


4 

0 
8 


168 
W1 
25 

1 


121 
34 

2 



ToWp. 1 ^. Anti-RT Ig Levels in MRKAdSpol Immunized macaques. 



R1 ANTIBODY ASSAYTITERS INmMUAinl 



Vocd ne^onkey T eg 



MPkAdShCMV-iApft»E3*^. lO^n VP 



99C100_ 



99C21S 
99D201 



MPK AdShCt^-'^'fga^V 10^9 VP 



99D212 



99D180_ 



99C201 



MPIf Ad5hCMV-«ApQlff3.1 10*11 VP 



99D239 



99C186 



99CD84 



MPICAd5hCMV-'Apnl(E3.y 10*9VP 



CDIG 



61 



81 



53 



T=7 



1999 



1541 



336 



10 



<10 



<10 



44 



21 



235 



CPU 



32 



20 



40 



36 



37 



460 
233 



2637 



175 
140 



16 



112 



T=12 



5928 



2356 



639 



49 



79 



71 



1=16 



4768 



2767 



387 



68 



93 



76 



1234 
480 



2858 



306 
273 



149 



1015 



345 
1626 



235 
419 



237 
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When rhesus macaques were umnunized i.m. with two doses of MRKAdSnef 
constructs, vigorous T cell responses ranging from 100 to as high as 1100 per millioi 
were observed in 8 of 12 vaccinees (Table 14). The efficacies of the mCMV- and 
hCMV- driven nef constracts are comparable on the basis of the data generated thus 
far. 



Table 14. Nef-specific T ceD Responses in »«RKAd5nef Immunized Rhesus 

Macaques. 



Vaccine (Tc0,4 wks) 

MRKAd6hCMV-nef(Q2A.LLAA) (E34-) 
10^11 vp 


Monk# 


P 

Mock 


re 

Nef 


T 

Mock 


b4 

Nel 


T 

Mock 


=7 

Nef 


Tc 
Mock 


>16 

Nof 


CD2D 
CC7B 
CC61 


0 
0 

2 


4 

0 
8 


31 
2 
31 


440 
521 
112 


4 
0 
0 


368 
178 
108 


1 
1 

11 


251 
1522 
100 


MRKAd5hCMV-nef(Q2A.LLAA) (E3f) 
10«9vp 


CC2K 
C016 
CD16 


9 
6 

6 


9 
4 
1 


6 
30 
6 


£2 
998 
1148 


0 
2 
0 


35 
586 
369 


0 
0 

1 


16 
434 
212 


MRKAdSmCMVHnef (Q2A,LLAA) (ESf ) 
10^11 VP 


990191 
99D144 
990193 


1 
4 
1 


5 
6 
2 


4 

5 
1 


614 
434 
58 


0 
0 
1 


298 
1100 
22 


2 
2 
0 


419 
932 
64 


MRKAd5mCMV-nef(G2A,LLAA) (E3^) 
1(K*0vp 

1 NalVe 


990224 
990250 
99C120 

083Q 


1 
8 
1 

nd 


11 
9 
6 

nd 


14 
4 
20 

18 


231 
106 
299 

22 


1 
0 
0 

4 


125 
54 
92 

5 


0 
0 
0 

2 


70 
5 
79 

1 



EXAMPLE 25 

Comparison of Clade B vs. Clade C T Cell Responses in HIV-infected Subjects 
PBMC samples collected from two dozens of patients infected with HIV-1 in 
US were tested in ELISPOT assays with peptide pools of 20-mer peptides overlapping 
by 10 amino acids. Four different peptide pools were tested for cross-clade 
recognition, and they were either derived from a clade B-based isolate (gag H-b; nef- 
b) or a clade C-based isolate (gag H-c, nef-c). Data in Table 15 shows that T ceUs 
from these patients presumably infected with clade B fflV-l could recognize clade C 
gag and nef antigens in ELISPOT assay. Correlation analysis further demonstrated 
that these T ceU responses against clade C gag peptide pool were about 60% of the 
clade B counterpart (Figure 24), whUe the T cell responses against dade C nef were 
about 85% of the clade B counterpart (Figure 25). These results suggest that cellular 
immune responses generated in patients infected with clade B HIV-1 can recognize 
gag and nef antigens derived from clade C HIV-1. These data show that a HFV 
vaccine, such as a DNA ox MRKAdS-based adenoviral vaccine expressing a clade B 
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gag and/or nef antigen will potentially have the ability to provide a prophylactic 
and/or therapetic advantage on a global scale. 



Table 15 

Responses Shown as the Number of gEPN-Secreting T Cells per Million 
PBMCs ' , 



subiect 


bleed date 1 gaa epitope « | mock 


gag H-b 


gagH-c 


ne<-b 


nef-c 




( 


from mapping' 


1 










#100 


19-JUI-99 


12 


10 


3950 


1385 


1295 


1300 


«101 


25-JUI-99 


3 


15 


3885 


1280 


no 


1020 


#102 


25-JUI-99 


4 


15 


1740 


850 


1255 


1785 


#104 


7-Jun-99 


2 


5 


1355 


1185 


no 


1060 


«107 


n-Oct-99 


2 


25 


3305 


2795 


670 


870 


#405 


ll-Jul-99 


2 


15 


4575 


3180 


1700 


1500 


8S01 


19-JUI-99 


2 


15 


1100 


570 


3365 


3460 


#505 


18-JUI-99 


5 


10 


2145 


1725 


1235 


na 


«S06 


28-Feb-99 


2 


25 


150 


45 


400 


610 


«701 


28-MQr-99 


5 


30 


7620 


4775 


3320 


2780 


«709 


l7-May-99 


3 


15 


2785 


1945 


1090 


1630 


«710 


24-May-99 


4 


5 


1055 


1080 


2210 


2140 

















EXAMPLE 26 

Characterization and Production of MRKAdSpol and MRKAdSnef 
Vectors in Roller Bottles 



Expansion of nef aid pal Adenovectors - Nef and pol CsCl purified MRKAdS 
seeds were used to infect rolte botties to produce P4 virus to be used as a seed for 
further experiments. P4 MRKAdS pol and nef vectors were use<} to infect roller 
botties at an MOI 280 vp/cell, except for hCMV-tpa-nef [E3+] which was infected at 
an MOI of 125 due to low titers of seed obtained at P4. 



Table 16 Viral particle concentrations for P5 nef and pol adenovectors 



Adenovector 


AEX Titer 
(10"*vp/iiil culture) 


AEX Titer 
(10*vp/cell) 


Amplification 
Ratio 


hCMV-FL-nef [E3+] 


1.1 


0.9 


30 


mCMV-FL-nef [E3+] 


2.2 


2.1 


75 


hCMV-tpa-nef [E3+] 


0.07 


0.1 


5 


mCMV-tpa-nef [E3+] 


1.3 


0.9 


35 


hCMV-FL-pol [E3+] 


2.7 


2.1 


75 


hCMV-FL-pol IE3-] 


1.9 


1.3 


45 



91 
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10 



Roller Bottle Passaging - Passaging of the pol and ne/constructs continued 
thiough passage seven. CeU-associated (freeze/thaw lysis) and whole broth (txiton^ 
lysis) titers obtained in all passages were very consistent In general, MRKAdSpol is 
ca. 70% as productive as MRKAd5gag while MRKAdSnef is ca. 25% as productive 
as MRKAdSgag. Samples of P7 virus for both constructs were analyzed by V&CB by 
restriction digest analysis and did not show any rearrangements. 



Table 17. Passage Six Viral Productivity for MRKAdSpol and MRKAdSnef 



taCMV-FLniBfDB34-l pool 



Xviabic (lOTccUsftnl), 
VU|]ility(%} 

Harvest 



1.22,85% 



0.99.62% 
MO. 72% 



Ceil Passage 
Number 



62 



ABXHter 
(Cdl Associated) 
10*" vp/m! caimre 



0.8 



•ntcr 



0-7 



Ainpti6cstiOD 
Ratio 



25 



TrilQo Lysis Titer 
10" vp^ml culture 



1.6 



hCMV.FL^[B3H 



pool 
T 



1.4^ 89% 



62 



4J5 



32 



115 



7.0 



1.22,70% 
1.42.74% 





XviBb]e(U 
Viabil 
IbfectiQo 


l^oeUsAnl). 

ty(%) 

Harvest 


Cell Passage 
Number 


A£X Titer 
(CeO Assodated) 
10*"vp/inl culture 


.Titer 
lO^vp^ccfl 


Ampliflcation 
Rado 


"nitos Lysis Titer 
lO^vpAnlcuInirc 


hCMV-FUicf[B3f] Ftx>l 


133,90% 




66 


1.0 


as 


29 


2.1 


1 
2 




0il6^70% 
1.18,73% . 














0.90^.90% 




56 


4.2 


4.7 


168 


6.S 


1 
2 




1.18, 88% 
1j04.80% 













20 



25 



MRKAdSiief and MRKAdSpol Viral Production Kinetics - A limecoursc 
experiment was carried out in roller bottles to determine if the viral production 
kinetics of the MRKAdSpol and MRKAdSnef vectors were similar to those of 
MRKAdSgag. PER.C6® cells in roller bottle cultures were infected at an MOI of 280 
vp/ceDs with PS MRKAdSpol, PS MRKAdSnef and P7 MRKAdSgag; for each 
adenovector, two infected botUes were sampled at 24, 36, 48, and 60 horns post 
infection. In addition, two bottles were left unsampled until 48 hpi when they were 
harvested under the Phase I process conditions. The anion-exchange HPLC viral 
particle concentrations of the fieeze-thaw recovered cell associated virus at the 24, 36, 
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48, and 60 hpi timepoints are shown in Figure 29A-B. The QPA titers show a sixnflar 
trend (data not shown). 

Canparison ofhCMV- and mCMV-FL-nef- As the titers obtained with the 
5 MRKAdSnef construct (hCMV-FL-nef) were lower than those obtained with 
MRKAd5gag or MRXAdSpol, a viral productivity comparison experiment was 
performed widi mCMV-FL-ncf . For each of the two adenovcctors (hCMV- and 
mCMV-FL-neO, two rofler bottles were infected at an MOI of 280 vp/cell with 
passage five clarified lysate. The macroscopic and microscopic observations of the 
10 four roUer bottles were identical at the time of harvest. Analysis of the clarified lysate 
produced indicated a higher viral particle concentration in the bottles infected with 
mCMV-Fl^nef , as shown in Table 19. It is stipulated that the higher productivity with 
mCMV promoter driven nef vector is due to lower nef expression levels in PER.C6® 
cells- experiments are underway at V&CB to measure nef expression levels. 



Table 19. Passage Six Viral Productivity Comparison of hCMV- and mCMV-FL-nef 







Xv (IC cens/Bri). ViabiUty (%) 




AEXUlci 


Titer 


Amplification 


Trttoo Lysis Titer 1 






InfectiOD 


Haivtat 


Number 


10" vp/ml culture 


10*vp/ceU 


Rado 


lO*" vp/ml cttlturt 


hCMV-PL-Dcf 


Pool 


1.11.91% 




60 


\5 


1.4 


30 


2.8 


(MRKAdSnef) 


1 
2 




134,74% 












mCMV-H^nef 


Pool 


1.11,91% 




60 


23 


2.1 


75 


4.6 


1 

2 




U9.84% 
I. IS, 77% 













2Q EXAMPLE 27 

Characterization and Large Scale Production of 
MRKAdSnef Viius in BioieacUHS 
Materials and Methods - The expoimcnt of the present example was nni twice 
under the following conditions: 36.5t, DO 30%. pH 7.30, ISOrpm agitation rate. 
25 no sparging, life Technologies (Gibco, Invitrogen) 293 SFM H (with 6mM L- 

glutamine), 0.5M NaOH as base for pH control. During the first nin (B20010115), 
two lOL stirred vessel bioieactors were inoculated with PER.C6* cells at a 
concentration of 0.2x10* ceUsAnl. Cells were grown until they reached a ceU 
concentration of appioxiniately 1x10* cclls/ml. The cells were infected with undoned 
30 MRKAdSnef (G2A JJLAA) at a MOI of 280 virus particles (vp)/cell. For the second 
batch (B20010202). Ite same inocedure as the first run was used, except the cells 
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were infected with cloned MRAdSnef. During both nins, the bioieactors were 
harvested 48 hours post-infection. Samples were taken and virus concentrations were 
determined from whole broth (with triton lysis), supernatant, and cell pellets (3 X 
freeze/thaw) with the AEX and QPA assays. Metabolites were measured with 
BioProfile 250 throughout the process. 



Table 20: Experimenta 


Conditions 


Temperature 


36.5 


DO 


30% 


PH 


730 


AKitation 


150 ipm 


Spareing 


None 


Table 21: Virus source used for experiments. 



Run 


Batch ID 


Cloned/Uncloned 


MOl 






MRKAd5nef 


(vp/cells) 


#1 


320010115-1 


Uncloned 


280 




B20010115-2 


Uncioned 


280 


#2 


B20010202-1 


Cloned 


280 




B20010202-2 


Closed 


280 



Results - Table 22 and 23 show an the ability to scale up production of 
MRKAdSnef by growth in a bioreactor. 

Table 22: Virus Concentration as measured by the AEX assay 



Run 


Batch ID 


Cloned/Uncloned 
MRKAdSnef 


Vims Concentration ® 48hpi (IxlO" vp/L) 


Supernatant 


Clarified Lysatc 


Total 


Triton Lysate 


#1 


B20010115-1 


Uncloned 


0.72 


3.26 


3.98 


5,76 


B20010115-2 


Uncloned 


0.38 


1.67 


2.05 


2.46 


#2 


B2001Q202-1 


Ooned 


0.80 


6.00 


6.80 


8.88 


B2001Q202-2 


Cloned 


0.50 


6.00 


6.50 


8,47 



Table 23: Virus Titers as measured by the QPA assay 



Run 


Batch ID 


GonedAJncloned 
MRKAdSnef 


Virus Concentration ® 48hpi (IxlO" lU/L) 


Whole 
Broth 


Supernatant 


Clarified 
Lysate 


Total 


Triton 
Lysate 




B20010115-1 


Uncloned 


0.13 


1.12 


1.76 


2.88 


11^8 


B20010115-2 


Uncloned 


0.14 


0.73 


1.54 


2.27 


5.86 


#2 


B20010202-1 


Goned 


0.14 


0.97 


1.62 


2.69 


11.89 


820010202-2 


Closed 


0.14 


1.17 


1,70 


2,97 


12,47 



The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in 
addidon to those described herein will become apparent to those skilled in the art 
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from the foregoing description. Such modifications are intended to f aU within the 
scope of the appended claims. 

EXAMPIJE28 
MRKAdSHIV-lgag Boosting of DNA-Primed Animals 

Groups of 3-5 rhesus macaques were immunized with (a) 5 mgs of VlJns- 
Hgag (pVUnsCMV(no intron)-FL-gag-bGHpA), (b) 5 mgs of VlJns-Flgag 
formulated with 45 mgs of a non-ionic block copolymer CRL1005, or (c) 5 mgs of 
VlJns-Hgag formulated with 7.5 mgs of CRL1005 and 0.6 mM benzalkonium 
chloride at weeks 0, 4, and 8. All animals recdved a single dose of 10e7 viral 
particles (vp)of the MRKAd5HIV-lgag at week26. Nol«: 10e7 is too low to prime 
or boost effectively when used as a single modality (dose is selected to mimic 
I«eexposuie to adenovirus); see Figure 32. 
15 Blood samples were collected from all animals at several time points and 

peripheral blood mononuclear cells (PBMCs) were prepared using standard Ficoll 
ir-hod. The PBMCs were counted and analyzed for gamma-interferon secretion 
using the EUSpot assay (Table 24). For each monkey, tiie PBMCs were incubated 
overnight either in the absence (medium) or presence of a pool (called "gag H") of 50 
20 20-aa long peptides that encompass the entire HIV-1 gag sequence. 

The results indicate that MRKAd5HIV-lgag was very effective in boosting 
the T ceU immune responses in these monkeys. At.week 28 or 2 weeks after the viral 
boost, tiie number of gag-specific T cells per million PBMCs increased 2-48 fold 
compared to tiie levels observed at week 24 or 2 weeks prior to the boost 
25 The PBMCs were also analyzed by intracellular gamma-interferon staining 

prior to (at week 10) and after die MRKAdSgag boost (at week 30). The results for 
select animals are shown on Figure 31. The results indicate that (a) immunization 
with DNA/adjuvant formulation elicited T cell responses which can eitiier be 
balanced. CD4*-biased or CD8*-biased, and (b) boosting with the MRKAdSgag 
30 construct produced in all cases a strongly CD8*-biased response. These results 

suggest that boosting with MRKAd5HIV-lgag construct is able to improve the levels 
of antigen-specific CDS* T cells. 
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EXAMPLE 29 

Construction of gagpol fusion for MRKAdSgagpol fusion constructs 

The open reading frames for the codon-optiniized HIV-1 gag gene was fused 
5 directly to the open reading frame of the lA pol gene (consisting of RT, RNAseH and 
integrase domains) by stepwise PGR. Because the gene (SEQ ID NO: 38) does not 
include the protease gene and the frameshift sequence, it encodes a single polypeptide 
of the combined size of p55. RT. RNAse H and integrase (1350 amino acids; SEQ ID 
NO: 39). 

10 The fragment that extends from the BstEH site within the gag gene to the last 

non-stop codon was Ugated via PGR to a fragment that extends from the start codon 
of the lApol to a unique BamH/ site. This fragment was digested with BstE// and 
BamH/. Gonstruction of gag-IApol fusion was achieved via three-fragment Ugation 
involving the Pst/-BstEf7 gag digestion fragment, the BstEZZ/BamHZ digested PGR 

15 product and long Pstf/BamHZ VlR-FLpol backbone fragment. 

The MRKAdS-gagpol adenovirus vector was constructed using the BgUT 
fragment of the VlR-gagpol containing the entire ORF of gag-IApol fusion gene. 

EXAMPLE 30 

20 Immunogenicity Studies in Non-Human Primates 

Gohorts of three (3) macaques were inmiunized with lOeS or lOelO viral 
particles (vp) of one of the following MRKAdS HIV-1 vaccines: (1) MRKAdSgag; 
(2) MRKAdSpol; (3) MRKAdSnef ; (4) a mixture containing equal amounts of 
25 MRKAdSgag, MRKAdSpol. and MRKAdSnef. or (5) a mixture of equal amounts of 
MRKAdSgagpol and MRKAdSnef. Hie vaccines were administered at weeks 0 and 
4. 

The T cell responses against each of the HIV-1 antigens were assayed by IFN- 
gamma EUSpot assay using pools of 20-aa peptides that encompass the entire protein 

30 sequence of each antigen. Hie results (Table 25) are expressed as the number of spot- 
forming cells (sfc) per million peripheral blood mononuclear cells (PBMC) that 
respond to each of the peptide pools. 

Results indicate the following observations: (1) each of the single gene 
constructs (MRKAdSgag. MRKAdSpol. or MRKAdSnef) is able to eUcit high levels 

35 of antigen-specific T cells in monkeys; (2) the single-gene MRKAdS constnicts can 
be mixed as a multi-cocktail f ormulaUon capable of eUciting very broad T ceU 
responses against gag, pol. and nef ; (3) the MRKAdS vector expressing the fusion 

97 



wo 02/022080 



PCT/USOl/28861 



protein of gag plus lA pol is capable of inducing strong T cell responses to both gag 
and pol. 



Table 25- Evaluation of Mixtures of MRKAdS vectors expressing humanized 



Grp # 


Vaccine 


Monk# 


Te6 wks 




T=0, 4 wks 




Mock 


GagH 


PoI-1 


Pol -2 


Nef 


1 




wD9V 


0 


15 


- 


- 


- 




10^10 vp 


CD19 


0 * 


374 


• 


- 


- 








1 


643 








2 


MRKAdS aaa 




1 


948 


- 








10^ vp 




16 


324 


- 


■ 


- 






143H 


4 


595 








3 


MRKAd5 pol 


V^O 1 A 


4 


- 


46 


256 


_ 




IC^IO vp 




3 


• 


463 


550 


- 






AV43 


6 




95 


1333 




4 


MRKAdS pol 


AW38 


1 


- 


19 


30 


- 




10^ vp 




0 


* 


50 


995 


- 






CC21 


1 




33 


• 436 




5 


MRKAdS nef 


076Q 


9 


- 


- 


- 


1204 




10^10 vp 


091 Q 


4 








85 






083Q 


0 


- 


- 


- 


176 


6 


MRKAdS nef 


00C029 


1 








114 




IQASvp 


980022 


6 








170 






96D160 


3 








198 


7 


MRKAdSgag-i-MRKAdSpol+MRKAdSnef 


9SD251 


3 


206 


15 


193 


120 




10^10 vp each 


05H 


3 


135 


21 


9 


638 






00C016 


3 


26 


4 


51 


23 


6 


MRKAdSgag+MRKAdSpol+MRKAdSnef 


99D215 


1 


171 


18 


193 


240 




10^ vp each 


61 H 


5 


73 


6 


14 


243 






12H 


8 


1140 


116 


811 


719 


9 


MRKAdSgagpol -t-MRKAdS nef 


99D211 


0 


83 


56 


838 


725 




10^10 vp each 


22H 


4 


385 


119 


1194 


1915 






61H 


4 


343 


11 


765 


853 


10 


MRKAdSgagpol +MRKAd5 nef 


34H 


3 


78 


19 


5 


75 




10^ vp each 


48H 


1 


65 


105 


46 


43 






70H 


5 


158 


15 


220 


191 



gag 



10 



H. fifty 20-aa peptides encompassing p55 sequence; poM, 20-aa peptides repiesenting N-tenninal half of lA pol; 
pol-2, 20-aa peptides representing the carboxy-tenninal half of lA pol; nef, 20-aa peptides encompassing die entire 
wild-type nef sequence. Responses to the antigens prior to die fixst immunization did not exceed 40 sfc/lO^ 
PBMC. 
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WHAT IS CLAIMED IS 

1 . A recombinant adenoviral vaccine vector at least partially deleted in 
5 El and devoid of El activity, comprising: 

a) an adenovirus cw-acting packaging region corresponding to from 
about base pair 1 to between from about base pair 400 to about 
base pair 458 of a wildtype adenovirus genome; and 

b) a gene encoding an HIV protein or immunologically relevant 
10 modification thereof- 

2. A vector in accordance with claim 1 comprising a packaging region 
coiresponding to from about base pair 1 to about base pair 450 of a wildtype 
hucnuvirus genome. 

3. A vector in accordance with claim 1 further comprising nucleotides 
15 corresponding to between from about base pair 3511 to about 3524 to about base pair 

5798 of a wildtype adenovirus genome. 

4. A vector in accordance with claim 3 comprising base pairs 
corresponding to 1-450 and 3511-5798 of a wildtype adenovirus genome. 

5. A vector in accordance with claim 4 which is deleted of base pairs 

20 451-3510. 

6. A vector in accordance witii claim 1 which is at least partially 

deleted in E3. 

7. A vector in accordance with claim 6 wherein the E3 deleted region 
is from base pairs 28,133-30,818. 
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8. A vector in accordance with claim 1 wherein the gene encoding the 
HIV protein or modification thereof comprises codons optimized for expression in a 
human. 

9. A vector in accordance with claim 1 wherein the vector comprises a 
gene expression cassette comprising: 

a) a nucleic acid encoding a protein; 

b) a heterologous promoter operatively linked to the nucleic acid 
encoding the protein; and 

(c) a transcription termination sequence. 

10. A vector in accordance with claim 9 wherein the gene expression 
cassette is inserted into the El region. 

11. An adenoviral vector in accordance with claim 9 wherein the gene 
expression cassette is in an El paraDel orientation 

12. An adenoviral vector in accordance with claim 9 wherein the gene 
expression cassette is in an El antiparallel orientation. 

13. An adenoviral vector in accordance witii claim 9 wherein the 
promoter is a cytomegalovirus promoter devoid of intronic sequences. 

14. An adenoviral vector in accordance with claim 13 wherein the 
promoter is an inmiediate early human cytomegalovinis promoter. 

15. An adenoviral vector in accordance with claim 9 wherein the 
promoter is a murine cytomegalovirus promoter. 

16. An adenoviral vector in accordance with claim 9 wherein the 
transcription termination sequence is a bovine growth honnone polyadenylation and 
transcription termination sequence. 
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17. An adenoviral vector in accordance with claim 9 wherein the 
transcription termination sequence is a synthetic polyadenylation signal (SPA). 

18. A cell comprising the adenoviral vector of claim 1. 

19. Recombinant, replication-defective adenovirus particles harvested 
5 and purified subsequent to transf ection of the adenoviral vector of claim 1 into a cell 

line which expresses adenovirus El protein at complementing levels. 

20. An HTV vaccine composition comprising purified adenovirus 
particles of claim 19. 

21. An HIV vaccine composition of claim 20 which comprises a 
10 physiologically acceptable carrier, 

22. A method of producing recombinant, replication defective 
.isnovirus particles containing the adenoviral genome of the adenoviral vector of 
claim 1 which comprises introducing the adenoviral vector into a host cell which 
expresses adenoviral El protein, and harvesting the resultant recombinant, 

15 replication-defective adenovirus. 

23. A method according to claim 22 wherein the cell is a PER.C6® 

ceU. 

24. A method of generating a cellular-mediated immune response 
against HIV in an individual comprising administering to the individual a vaccine of 

20 claim 21. 

25. A method according to claim 24 which further comprises 
administration to the individual a DNA plasmid vaccine, optionally administered with 
a biologically effective adjuvant, protein or other agent capable of increasing the 
immune response. 
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26. A method according to claim 25 wherein the DNA plasmid 
vaccine is administered to the individual prior to administration of an adenovims 
vaccine. 

27. A method according to claim 24 wherein the adenovirus vaccine is 
preceded by an adenovirus vaccine of a different serotype. 

28. A method according to claim 24 which comprises administering 
and readministering the adenovirus vaccine vector to the individual. 

29. An adenoviral vector in accordance with claim 1 wherein the HIV 
protein is HIV gag or an immunologically relevant modification thereof. 

30. An adenoviral vector in accordance with claim 9 wherein the gene 
expression cassette comprises an open reading frame encoding an HIV gag protein or 
•'nmunologically rdevant modification thoceof. 

31. A recombinant adenoviral vaccine vector at least partially deleted 
in EI and devoid of El activity, comprising: 

a) an adenovirus w-acting packaging region corresponding to from 
about base pair 1 to about base pair 450 and from about 3511 to about 
5798 of a wildtype adenovirus genome, and deleted for base pairs 
corresponding to from about base pair 451 to from about base pair 
3510 of a wildtype adenovirus genonae; and 

b) a gene expression cassette comprising 

i) SEQIDNO:29; 

ii) a heterologous promoter operatively linked to i); and 

iii) a transcription termination sequence. 
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32. An adenoviral vector in accordance with claim 31 wherein die 
gene expression cassette is in an El parallel orientation. 

33 An adenoviral vector in accordance v^ith claim 31 wherein the gene 
expression cassette is in an El antiparallel orientation. 

34. An adenoviral vector in accordance with claim 31 wherein the 
promoter is a cytomegalovirus promoter devoid of intronic sequences, 

35. An adenoviral vector in accordance with claim 31 wherein the 
transcripdon termination sequence is a bovine growth hormone polyadenylation and 
transcription termination sequence. 

36. An adenoviral vector in accordance with claim 31 which is at least 
partially deleted in E3. 

37. A cell comprising the adenoviral vector of claim 30. 

38. Recombinant, replication-defective adenovirus particles harvested 
and purified subsequent to transfection of the adenoviral vector of claim 30 into a cell 
line which expresses adenovirus El protein at complementing levels. 

39. An HIV vaccine composition comprising purified adenovirus 
particles of claim 38. 

40. An HIV vaccine composition of claim 39 which comprises a 
physiologically acceptable carrier. 

41 . A method of producing recombinant, replication defective 
adenovirus particles containing the adenoviral genome of the adenoviral vector of 
claim 30 which comprises introducing the adenoviral vector into a host cell which 
expresses adenoviral El protein, and harvesting the resultant recombinant, 
jeplication*defective adenovirus. 
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42. A method according to claim 41 wherein the cell is a PER.C6* 

ceU. 

43. A method of generating a cellular-mediated immune response 
against HIV in an individual comprising administering to the individual a vaccine of 
claim 21. 

44. A method according to claim 43 which further comprises 
administration to the individual aDNA plasmid vaccine, optionaDy administered with 
a biologically effective adjuvant, protein or other agent capable of increasing die 
immune response. 

45. A method according to claim 44 wherein the DNA plasmid 
vaccine is administered to the individual prior to administration of an adenovirus 
vaccine. 

46. A method according to claim 43 wherein the adenovirus vaccine is 
preceded by an adenovirus vaccine of a different serotype. 

47. A mefliod according to claim 43 which comprises administering 
and readministering the adenovirus vaccine vector to the individual. 

48. An adenoviral vector in accordance with claim 1 wherein the HIV 
protein is HIV pol or an immunologically relevant modification thereof. 

49. An adenoviral vector in accordance vnth claim 9 wherein the gene 
expression cassette comprises an open reading frame encoding an HIV pol protein or 
immunologicaily relevant modification thereof. 

50. A recombinant adenoviral vaccine vector at least partially deleted 
in El and devoid of El activity, comprising: 
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a) an adenovirus cij-acting packaging zegion corresponding to from 
about base pair 1 to about base pair 450 and from about 3511 to about 
5798 of a wildtype adenovirus genome, and deleted for base pairs 
conesponding to from about base pair 451 to from about base pair 
3510 of a wildtype adenovirus genome; and 

b) a gene expression cassette comprising 

i) a nucleotide sequence selected the group consisting of 
SEQ ID NO: 1, SEQ ID NO: 5 and SEQ ID NO: 7; 

ii) a heterologous promoter operatively linked to i); and 

iii) a transcription termination sequence. 

51. An adenoviral vector in accordance with claim 50 wherein the 
gene expression cassette is in an El parallel orientation. 

52. An adenoviral vector in accordance with claim 50 wherein the 
gene expression cassette is in an El antiparallel orientation. 

53. An adenoviral vector in accordance with claim 50 wherein the 
promoter is a cytomegalovirus promoter devoid of intronic sequences. 

54. An adenoviral vector in accordance with claim 50 wherein the 
transcription termination sequence is a bovine growth honnone polyadenylation and 
transcription termination sequence. 

55. An adenoviral vector in accordance with claim 50 which is at least 
partially deleted in E3. 

56. A cell comprising the adenoviral vector of claim 49, 
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57. Recombinant, replication-defective adenovirus particles har^ 
and purified subsequent to transfection of the adenoviral vector of claim 49 into a cell 
line which expresses adenovirus El protein at complementing levels. 

58. An HIV vaccine composition comprising purified adenovirus 
particles of claim 57. 

59. An HIV vaccine composition of claim 58 which comprises a 
physiologically acceptable carrier. 

60. A method of producing recombinant, replication defective 
adenovirus particles containing the adenoviral genome of the adenoviral vector of 
claim 49 which comprises introducing the adenoviral vector into a host ceU which 
expresses adenoviral El protein, and harvesting the resultant recombinant, 

plication-defective adenovirus. 

61. A method according to claim 60 wherein the cell is a PER.C6® 

cell. 

62. A method of generating a ceUular-mediated immune response 
against HIV in an individual comprising administering to tiie individual a vaccine of 
claim 59. 

63. A method according to claim 62 which further comprises 
administration to tiie individual a DNA plasmid vaccine, optionaDy administered witii 
a biologically effective adjuvant, protein or other agent capable of increasing tiie 
immune response. 

64. A metiiod according to claim 63 wherein the DNA plasmid 
vaccine is administered to the individual prior to administration of an adenovirus 
vaccine. 

106 



wo 02/022080 



PCT/USO 1/28861 



65. A method according to claim 62 wherein the adenovirus vaccine is 
preceded by an adenovirus vaccine of a different serotype. 

66- A method according to claim 62 which comprises administering 
andreadministering the adenovirus vaccine vector to the individual. 

67. An adenoviral vector in accordance with claim 1 wherein the HIV 
protein is HIV nef or an immunologically relevant modification thereof, 

68. An adenoviral vector in accordance with claim 9 wherein the gene 
expression cassette comprises an open reading frame encoding an HTV nef protem or 
immunologically relevant modification thereof. 

69. A recombinant adenoviral vaccine vector at least partially deleted 
in El and devoid of El activity, comprising: 

a) an adenovirus cw-acting packaging region corresponding to from 
about base pair 1 to about base pair 450 and from about 3511 to about 
5798 of a wildtype adenovirus genome, and deleted for base pairs 
corresponding to from about base pair 451 to from about base pair 
3510 of a wildtype adenovirus genome; and 

b) a gene expression cassette comprising 

i) a nucleotide sequence selected the group consisting of 
SEQ ID NO: 9, SEQ ID NO: 11, SEQID NO: 13 and 
SEQIDNO: 15; 

ii) a heterologous promoter operatively linked to i); and 

iii) a transcription termination sequence. 

70. An adenoviral vector in accordance with claim 69 wherein the 
gene expression cassette is in an El parallel orientation. 
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71. An adenoviral vector in accordance with claim 69 wherein the 
gene expression cassette is in an El antiparallel orientation. 

72. An adenoviral vector in accordance with claim 69 wherein the 
prontioter is a cytomegalovirus promoter devoid of intronic sequences. 

73. An adenoviral vector in accordance with claim 69 wherein the 
transcription termination sequence is a bovine growth hormone polyadenylation and 
transcription termination sequence. 

74. An adenoviral vector in accordance with claim 69 which is at least 
partially deleted in E3. 

75. A cell comprising the adenoviral vector of claim 68. 

76. Recombinant, repUcation-defective adenovirus particles harvested 
purified subsequent to transfection of the adenoviral vector of claim 68 into a cell 

line which expresses adenovirus El protein at complementing levels. 

77. An HIV vaccine composition comprising purified adenovirus 
pardcles of claim 76. 

78. An HTV vaccine composition of claim 77 which comprises a 
physiologically acceptable carrier. 

79. A mediod of producing recombinant, replication defective 
adenovirus particles containing the adenoviral genome of the adenoviral vector of 
claim 68 which comprises introducing the adenoviral vector into a host cell which 
expresses adenoviral El protein, arid harvesting the resultant recombinant, 
replication-defective adenovirus. 

80. A method according to claim 79 wherein the cell is a PER.C6® 

cell. 
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81. A method of generating a ceUular-mediated immune 

against HIV in an individual comprising administering to the individual a vaccine of 
claim 78. 

82. A method according to claim 81 which further comprises 

5 administration to the individual a DNA plasmid vaccine, optionally administered with 
a biologically effective adjuvant, protein or other agent capable of increasing the 
immune response. 

83. A method according to claim 82 wherein the DNA plasmid 
vaccine is administered to the individual prior to administration of an adenovirus 

10 vaccine. 

84. A method according to claim 81 wherein the adenovirus vaccine is 
preceded by an adenovirus vaccine of a different serotype. 

85. A method according to claim 81 which comprises administering 
and readministering the adenovirus vaccine vector to the individual. 

15 86. A multivalent adenovirus vaccine composition comprising 

recombinant, replication-defective adenovirus particles, wherein the adenovirus 
particles are harvested and purified from a cell line expressing adenovirus El t)rotein, 
and wherein the particles are harvested subsequent to transfection of the cells with an 
adenoviral vector or vectors in accordance with claim 9; said vector(s) comprising a 

20 gene expression cassette or cassettes comprising nucleotide sequences encoding HTV 
proteins selected from the group consisting of: 

a) gag, pol, and nef, expressed independently from three individual 

vectors; 
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b) gag, pol, and nef, expressed independently ficom one vector with 
the encoding nucleic acid sequences operatively linked to distinct 
promoters and transcription termination sequences; 

c) gag, pol, and nef, expressed via two vectors, one expressing a pol- 
nef fusion, and another expressing gag; 

d) gag, pol, and nef, expressed via two vectors, one expressing a gag- 
pol fusion and another expressing nef; 

e) gag, pol and nef, expressed via two vectors, one expressing a nef- 
gag fusion and another expressing pol; 

0 gag, pol, and nef, expressed via one vector expressing a gag-pol- 
nef fusion; 

g) gag and pol, expressed independently from two individual vectors; 

h) gag and pol, expressed independently from one vector with the 
encoding nucleic acid sequences operatively linked to distinct 
promoters and transcription termination sequences; 

i) pol and nef, expressed independently from two individual vectors; 
j) pol and nef, expressed independently from one vector with the 

encoding nucleic acid sequences operatively linked to distinct 
promoters and transcription termination sequences; 

k) nef and gag, expressed independently from two individual vectors; 

1) nef and gag; expressed independently from one vector with the 
encoding nucleic acid sequences operatively linked to distinct 
promoters and transcription termination sequences; 

m) gag and pol, expressed via one vector expressing a gag-pol fusion; 
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n) pol and nef, expressed via one vector expressing a pol-nef fusion; 
and 

o) nef and gag, expressed via one vector expressing a nef-gag fusion. 

87. A multivalent adenovirus vaccine composition in accordance with 
claim 86 wherein the gag-pol fusion consists of SEQ ID NO: 39. 

88. A multivalent adenovirus vaccine composition in accordance with 
claim 86 wherein the fused sequences have the encoding nucleic acid sequences 
operatively linlced to distinct promoters and transcription termination sequences. 

89. A multivalent adenovirus vaccine composition in accordance with 
claim 86 wherein the fused sequences have the encoding nucleic acid sequences 
operatively linked to a single promoter, and the encoding nucleic acid sequences 
u^iadvely linked by an internal ribosome entry sequence ("IRES")- 
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Orierna/ Adenovector Construct: 




Figure 1; Original HIV-1 gag adenovector. 
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R«.r ,i.rnee of the oDfen readin n frame for Fl-oan fhuman COdon optimised) 



al! 



caaaaagaagiaMaBdaaagcacattglgtgggartccagggagciggagaggWgctgtgaac^ 
dgctggagacctrtgagggglgcaggcagatatgggcMgctccagccctccctgcaaacaggctc^gg 



acai 



MtattctrtQc«rtglclgaggglgcMa«cccaggacclgaacaK»1gclgaacacagigggggg(^^^ 
MQC^gcJtgcagatgdgaaggagaccatcaatgaggaggdgctgagtgggacaggcl^^^ 

Sg^catSc^^ggSiBal^^^^ 

SQaaQcaQatiggclgoRigaocaa(aacoco««atcc«gigggggaaatttacaagaggtggai^^^ 
wtggg«Agaawagattgtgaggatgiadcccc«aKrtcM^ 



gi?iaaatctggccc1wcacaagggcaggcctggcaacttc^^^ 



agciglaccccctg9c«iccclgaggtccctgWggcaacgacccctcclcc«agtaaaataaag««gggca 

gat (SEQrDNO:29) 
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Old Transgene: 



New Transgenes: 




Figure 3: Diagrammatic representation of ihe oriQlnal HIV-l gag transgene and the series o! 
new iransgene constructions. 
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Figure 4: Modifications made to the current adenovector backbone in the generation of the new 
vector. 
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Expt P^ *— TT 



EJ E3- 



(E3 vsE3*) 



£) 
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Figure 5: Virus mixing experiments lo determine the effects of the addition made to the packaging signal 
region (Ex pi l>1 ) end anai>'st$ of the etfe^E ol the E3 gene on viral growth (Estpt. «2). Ttte led t>ars denote 
the region of modHicalions made to the El deletion. 
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Figure 6: Auloradiograph of viral DNA analysis following viral mixing experiments (expts. tl 
and «2) as detailed in the text. 

\ 



6/144 



wo 02/022080 



PCT/USOl/28861 




Figure 7A: hCMV-Figag-bGHpA acienovectors constructed within the "MRK" bacUxme. El 
paraW and £1 antiparatlel transoer>e orientation wKhin the E3- and E3^ backborves were 
construciBd. 
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Rgure 7B: hCMV-FLgae-SP A adenovectors constructed within the "MRK" backbone. El 
psraiiel end E1 entiparallel ifansgene orientation wtthm the E&- and E3^ backbones were 
constructed. 
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E3' 



E3* 



Figure 7C: mCMV*FLga^bGHpA aG)enovector$ constnicted wnhin the *MRK* bacKt>one. £i 
parallel and El Bntiparaliel transgene orientation within the E3- and EZ^ backbones were 
constructed 
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Plasmid mixing expt: (orientation) 




Figure BA: Eflect of tran^gene orientation 
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Piasmid Mixing expt: (poiy a signal) 
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Figure 8B: Effect of potyadenylatton signal 
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Rgure 9: Viral DNA ffom the four Adgag candidates at PS. following BstB^ 1 digestion. 
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Figure 10: Viral DNA anelyste of passage 11 and/or 12 of MBKHVE3, MRKAdSflag end 
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Figure 11: Viral DNA anal/sis {HMW digestion) of passage 6 MRKHVE3 and MRKAdSgag used 
to infiiate the vira' competition study, last two lanes are passage 1 1 analysis of duplicate 
passages of the competition study (each virus at MO) 280 vp). 
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Figure 12: Viral DNA analysis by H/ndllt digestion on high passage numbers lor MRKAdSgag In 
serum coniaining media with collections nrvade at specified times. The first lane shows the 1 Kb 
DNA size marKer. The other lanes represent pra-plasmid control (digested with Pad and HindlU). 
and MRKAdSgag virus continually passaged to P16. PIG and P21 (serum containing madia). 
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Figure . Strom ami*p24 Levels at 3 XMcspost i.in. immunization of balb'c mice (d*=]0) 
wiib Vaning Doses of SeveraJ Adgag constnicis: (A) MRKAdSgag (through passage 5): 
(B) MRKAdS E3- hCMV-FLgag-bGHpA; (C) MRK.Ad5 IT hCMV-FLgag-SPA; (D) 
MRKAd5 mCMN'-FLgag-bGHpA; (D) research Loi (293 cell-derived) of AdSHIV- 
Igag; aj)d (F) chnica) lot (Ad5gagFN0001 ) of AdSHIV-lgag. Reponed are ihe geometric 
mcaii liters (GMT) for each cohon. 
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FIGURE 16 
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AGAlC]ACCATGCCCCaATCTCCaCATlGAG/CTGTCaTGTGAAGC 

Bain MetAloProlleSerProlleCluThrVolProVolLysLeulysProGlyMelAspGlyProLysVolLy 
t 10 20 

GCAGTGGCOCCTGAClGAGCAC/^AT(mCCClGCTGCIJ^TCTGCAClGAGATa»^ 
sGlnTrpProLeuThrCiuGlu^ys]leLysAtoLeijVolClulleCysThrCliMelGitiLysCii<;iyLysIleSerL 
30 40 50 

ACATlG<X!CCa;AfiAACCCCTACAACADCCCTGTGniGCCATCAAGAAGAAGGACTCCAa 

ys 1 1 eG I yProG luAsnProTyrAsnThrProVo IPheAl ol I cLysLysLysAspSerlhrLysTrpArgLysleuVo I 

GACTTCAGGGACCTCAACAAGAGCACCCAGGACTTCmAGGTGCAGClCGGCAlCCCCCACCCCGCTGCCCTGW^ 
AspPheArgGluLeuAsnLysArglhrGlnAspPheTrpGluVolGinLei^lylleProHisProAloGlyLeuLysly 
80 90 100 

CAAGAAC1CTCTCAC1C1CC1CGCTC1GC6GGA7CCCTACTTCTCTGTGCCCC1GGA1GAGGACTTCADCAAGTACAC1G 
slysLysSerVonhrVolLeuAl oVolGlyAspAIoTyrPhcSerVo IProLeuAspG I uAspPheArgLysTyrThrA 
110 "~ 120 130 

CCTTCACCATCCCCTCCAICAACAATGACACCCCTGGCATCAGGTACCAGTACAAICTGCTGCCCCAGGGCTGGWGGGC 
loPheTnrlleProSerlleAsnAsnGluThrProClylleArgTyrGlnlyrAsnVolLeuProGlnGlyTrplysGly 

140 150 

ICOXIGCCATCTICCACICCICCAIGACCAAGATCCTCGAGCCCTTCAGGAACCAGAACCCIGACAITCTGATCTACCA 
SerProAloI lePheGlnSerSerMeUhrLys 1 leLeiC luProPheArgLysG InAsnProAspl leVol 1 leTyrC I 
160 170 180 

CTACA7GGCTCCCC1C7ATGTGGGCTCTGACCTGGAGAT1GGGCAGCACAGGACCAAGATTGAGGAGCTCAGGCAGCACC 
nlyrMelAloAloUuTyrVolGlySerAspLeuGlulleGlyGlnHisArgThrLyslleGljClulJuArgGlnHisL 
190 200 210 

TGClGACGlGGGGCC1CACCA[»CClGACAAGMGCACCACAA0GACCCC0CCTTa^^^ 
edeuArQTrpGlyLeuThrThifroAsplysLy^isGliaysGluPrfProPheLeuTr|AtelGlyTyrClid.euHis 

220 230 

Ca£ACAAG1GGADlGlDCAGCCCAl1GTCCTGCn(UCAAGGAC1CC10^^^ 
ProAsplyslrplhrVolCinProIieVoiletf'roGlulysAspSerlrp'IhrVolAsnAspIleGlnlysLeuVolGI 
240 250 260 

CAAGCTGACTGGGIXTCCCAAAlCTACCCTGGCATCAACCTGACCCAGCTGTGCAAGClGCTGAGGOGCAa 
yLysleuAsnlrpAloSerGlnllelyrProGlylleLysVolArgGlnleuCysLysLeuLeuArgKlyThrLysMo^ 



FIGURE 17A 
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lCACTCACGlCATC(XCC7CACTGftGCADGClGACCT|»aiCGClGAGAACA0(»AGA^ 
EiiThrGluVolIleProLeulhrCluGluAlflGluleuGluleuAloBluAsnArgClulleLeuLysGluProVolHis 

300 

CGGCTGTACUTGACCCClCCAAGGACCTGATTCCTGttAlCCAGAAGCAGGGCCAGGGa^ 
ClyVolTyrTyrAspProSerLysAspLeulleAloGlulleGlnLysClnGlyClnGIyGlnTrplhrTyrGlnlleTy 

320 340 

CMXADCCCTICMSAACCTGAAGACTCGCAAGTATCCCAGGATGAGGGGGGCCCACAIXAATGATGIGA 
rGlnGluProPbeLysAsnLeulysThrGlyLysTyrAloArgMelArgGlyAloHisThrAsnAspVdllysClnLeu^ 
350 360 i'O 

Cr.AGCCTGTCCAGAAGATCACCACTGACTCCATTGT6AlCTGCGGCAAGACCCC(^TTC^^ 
hrGluAloVolGlnLysllelhrThrGluSerlleVolIleTrpGIyLysThrProlysPhcLysLeuProlleGlnLys 

380 390 

GAGACCTGGGAGACClGGTGGACTGAGlACTGGCAGGCCACCTGGAlCaiGAGTCGGACTnGTGAACAC^ 
GlulhrTrpGluThrlrpTrpThrCluTyrTrpGlnAloThrTrplleProGluTrpCluPheVolAsnThrProProLe 
400 410 420 

GGlCAAGClClGGlACCACClGGAGAAGGAGCC»^T1GlGGGGGClGM;ACCTlCTATG1GGClGGGGCTGa:AAC^ 
uVolLysLeulrpTyrGlnLeuGluLysGliiProlleVolClyAloCluthrPheTyrVoIAIoGlyAloAloAsnArg^ 
430 440 4?0 

AGACCAAGCIGGGCAACGCTGGCUIGTGACCAACACGGCCAGGCAGAACGICGTGACCCTGACTGACACCACCAACCM^ 
lulhrLysLeuGlyLysAloGlyTyrVolTbrAsnArgClyArgGlnLysVolVolThrLeuThrAspThrThrAsnGIn 

460 470 

AAGACTGCCCTO^GCCATClACCIGGCCCTCCAGGACTCTGGCCTGGAaTGAACAnGIGACTGCCJC^^ 
LvsThrAloLciClnAloIlelyrLeuAloLejClnAspSerGlyLeuGluVolAsnlleVolThrAiiSerClnTyrAI 

' 480 490 MO 

CCIGGGCAICATCCAGGCCCAGCCIGATCACTCTCACICTGACCIGCTCAACW^ 
oLeiiGlyl lei leGlnAI oGlnProAspGlnSerGlilSerGluLwVolAsnGlnl iel leGliClnLeul lelysLysG 
5J0 520 530 

WIAACGTGTACClCGCCTGGGlGCCTGCCCACAACGCCATTGGGGGCAAIGABWa^ 
lulysVolTyrLeuAloTrpVolProAloHisLysGlylleGljfGlyAsnGliCCnVolAspLysLeuVolSerAloGly 

540 550 

AlCAGGAACGTrcTCTTCCTCCATGGCATTCACAAGCCCCACGATCAGaTGAGAACIA^^ 
JUAroLysVolLeuPheLeuAspGlylleAspLysAljClnAspGluHisCluLyslyrHisSertenTrpArgAI*^ 

5B0 570 5B0 

FIGURE 17B 



44/144 



wo 02/022080 



PCT/USOl/28861 



GWiClCTGACTTCAACCTCCCrCCTGTaSlGCCTAAGGACATTGlCaClCClCTGADWJTGCCAGCTGW^^ 
tAloSerAspPheAsnLeuProProVoIVolAloLysGtuMeVolAloSerCysAspLysCysGlnLeuLysGlvGluA 
590 600 610 

CCATGCATGGGCAGGTGGACTCCmTCCCATCTGCCAGCTGGCCTGCACCCACClGGAGGG^ 
laMelHisClyClnVoIAspCysSerProGlyl leTrpGlnLeuAIoCysThrHisLeuGluGlyLysVol I leLeuVol 

620 ~ 630 

GCTGTGCAlGTGGCCTCCOJCTACATrGAGGCTGAGGTGATCCCTGCTGAGACAGGCCAGGACACTGCCTAC^ 
AloVolHisVolAloSerGlyTyrlleGluAIaCluVollleProAloGluThrClyClnGluThrAloTyrPheLeuLe 
6*0 650 660 

GAAGCIGGCTGGCACGTGGCCTGTGAAGACCATCCACACTGCCAATGGCTCCAACTICACTGGGGCCACAGTGAGG^^ 
uLysleuAliClyArgTrpProVolLysThrlleHlsThrAloAsnGlySerAsnPheThrGlyAloThrVolArQAloA 
670 — 680 690 

CCTGCTCGTCGCCTGCCATCAACCAGGAGlTlGGCAlCCtCTACAACCCCCAGlCCCACCGCCTGC 
loCysTrpTrpAloGlylleLysGlnGluPheGlylleProTyrAsnProClnSerGhGlyVclVoIAIoSerMelAsn 

700 710 ~ 

AACCAGCTCAAGAACA7CATTGGGCAGCiGAGCGACCACCCTGAGCACCTGAAGACAGC7C7GCAGA7GGCTC7GTTCAT 
lysGluLeuLysLysJIelleGlyGlnVolArgAspGlnAloCluHisLeuLysIhrAloVoIGInMelAloVolPhell 
720 730 740 

CCACAACTtCAAGAGGAAGGGGCGCATOGCGGCCTACTCCGCTGGGGAGAGGATTGTGGACATCATlCCCACAG^^^ 
eHisAsnPheLysArglysGlyC lyl leGlyGlyTyrSerAIeGlyGluArgl leVolAspI lei IcAloThrAspl leG 
750 760 770 

ACACCAAOGAGCTCCAGAAGCACATCACCAAGATCCAGAACTTCAOXITCTACTACAGGGACTCCAGGAACC^ 
InlhrLysC I uLeuG inLysG In I leThrLysl leG I nAsnPhcArgVo I Tyr Tyr ArgAspSer ArgAsnProLeuTrp 

780 790 

AACGGCCCTCttAAGCTGClCTGGAAGGCGGACGGGGCTGTCGlCATCCAGGACAACTCTGACATCA^ 
LysGiyProAlolysLeuleuTrpLysGlyGliClyAloVoWoUleGlnAspAsnSerAsplleLysVolVolProAr 
600 BID £20 

GAGGAACGCCAACATa7(%GGAClAT(mADCAGATimBGaATGAnGlCl^ 
gArglysAloLysIlelleArgAspTyrCljrlysClnMelAleClyAspAspCysYolAloSerArtflnAspGluAspx 
£20 640 850 

AAAGCCCGGGCAGATCV (SEQIDN0:3) 

HGURE 17C 
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vr 

OPT 

wr 

OPT 

wr' 

OPT 

wr 

OPT 

WT 
OPT 

WT . 
OPT 

WT 
OPT 

WT 
OPT 

vr 

OPT 

WT 
OPT 



ATG GST GGC AA6 TGG TCA AAA C6T AGT GT6 CCT GGA TG6 TCT -42 

III II 111 III III II II 1 III II II III II 
ATG GGC GGC AAQ TGG TCC AAG AGG TCC GTG CCC GGC TGG TCC 
HGGKWSKRSVP6WS -14 

AQ GTA AGg' gAA AGA ATG AG^ CGA GAG CCA GCA GCA GAT -84 

II II III II 11 III II I II III II II II II 
ACC GTG AGG GAG AGG ATG AGG AGG GCC GAG CCC GCC GCC GAC 

TVRERNRRAEPAAD -28 

AGG GTG AGA CGA ACT GAG CCA GCA GCA GTA GGG GTG GGA GCA >126 

III III II t II III II II II II II III II II 
AGG GTG AGG AGG ACC GAG CCC GCC GCC GTG GGC GTG GGC GCC 

RVRRTEPAAVGVGA -42 

GTA TCT CGA GAC CTG GAA AAA CAT GGA GCA ATC ACA AGT AGO -168 

II II I III III II II II II II III 11 I 
GTG TCC AGG GAC CTG GAG AAG CAC GGC GCC ATC ACC TCC TCC 
VSRDLEKHGAITSS -86 

AAT ACA GCA GCT ACC AAT GCT GAT TGT GCC TGG CTA GAA GCA -210 

II II II II III II II II II III III II II II 
AAC ACC GCC GCC ACC AAC GCC GAC TGC GCC TGG CTG GAG GCC 
NTAATNADCAWLEA -70. 

CAA GAG GAT GAG GAA GTG GGT TTT CCA GTC AGA CCT CAG GTA -252 

II III II III II III II II II II II II III II 
CAG GAG GAC GAG GAG GTG GGC TTC CCC GTG AGG CCC CAG GTG 
QEDEEV 6FPVRPQV -84 

CCT TTA AGA CCA ATG ACT TAC AAG GGA GCT GTA GAT CTT AGC -294 

II I II II III II III III II II II 11 II I 
CCC CTG AGG CCC ATG ACC TAC AAG GGC GCC GTG GAC CTG TCC 

PLRPMTYKGAVDLS -98 

CAC TH TTA AAA GAA AAG GGG GGA CTG GAA GGG aA ATT CAC -336 

III II 1 II II III il II III II II II II III 
CAC TTC CTG AAG GAG AAG GGC GGC CTG GAG GGC CTG ATC CAC 

NFLKEKGGLEGLIH -112 

TCA CAG AAA AGA CAA GAT ATC Cn GAT CTG TGG GTC TAC CAC -378 

II III II II II II III II II III III II III III 
TCC CAG AAG AS6 CAG GAC ATC CTG GAC CTG TGS GTG TAC CAC 
SQKRQOILDLMV.YH -126 

ACA CAA GGC TAC TTC CCT GAT TGG CAG AAC TAC ACA CCA GGG -420 

II II III III III II II III III III III II II II 
ACC CAG GGC TAC TTC CCC GAC TGG CAG AAC TAC ACC CCC GGC 
T06YFPDW0NYTPG .140 



FIGURE 19A 
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tfr . CCA 6SA ATC AGA -m CCA TTG ACC TTT GBA TGS TGC TTC AAG -462 

It II III II II II II III II II III III III III 
OPT - CCC GGC ATC AGG TTC CCC CTG ACC TTC 6GC TG6 TGC TTC AAG 

p 6 I R F P L T F 6 W C F K .154 

UT - CTA GTA CCA (HT BAG CCA BAA AAG GTA GAA GAS BCC AAT 6AA -504 

II II II II III II II III 11 11 111 III II 11 
OPT - CTG CTG CCC GTG SAB CCC GAG AAG GTG GAB GAG GCC AAC BAG 

LVPVEPEKVEEANE -168 

in . G6A GAG AAC AAC TGC TTG TTA CAC CCT ATG AGC CAG CAT GGG -546 

II ill III III III II I III II III i HI II II 
OPT . GGC GAG MC AAC TGC CTG PG CAC CCC ATG TCC CAG CAC GGC 

6ENNCLLHPHSQHG -182 

. ATA GAG 6AC CCG GAG AAG GWk GTG TTA BAG TGG AGG TTT GAC -588 

II III III II III 111 II III 1 I III 111 111 I I II 
DPT - ATC GAG GAC CCC GAG AAG GAG GTG CTG GAG TGG AGG TTC GAC 

I E D P E K E V L E W R F D -196 

in- ■ . AGC AAG CTA GCA TH CAT CAC GTG GCC CGA GAG CTG CAT CCB -630 

I 111 II II II II III III III I III 111 II II 
OPT . TCC AAG CTG GCC TTC CAC CAC GTG GCC AGG GAG CTG CAC CCC 

SKLAFHHVARELHP -210 

- GAG TAC TAC AAG GAC TGC TSA (SEQ ID N0:30) -651 

DPT - BAG TAC TAC AAG GAC TGC TAA (contained within SEQIDNO:*) 

E Y Y K D C (SEQtoNO:10) -216 

FIGURE 19B 
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HIV^PoNnact (opt) 



MRKpdel 



>ONpA(sfr.) 




KanR 



6gll1/phosphatB8e 



BqII 1(i$Dlate pol gene fragment) 




Ligation 

Screen for correct orientation of pol 



HlV-Pd-inact (opt) 




Cial digest pre-pla$mid 



Digest with Pad and Bst1107 I (or Bst2l07 t) 



Bacterial Homologous Recombination (BJSi63) 
XL-1 blue (plasmid amplification} 



HlV^Pol'inact (opt) 



Figure 1: Construction of pre-plasmid ipMRKAdSpor 
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Bg) 11 /phosphatase Bgtl 1 (isolate nef gene fragment) 




B» iienu)S9j 
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Figure 2: Construction of pre-plasmid *pMRKAd5ner. 
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MRKAd5polMER1062 
(MRKAdS Pre-AOenovira] Vecior Containing the lA opt pel Cooing Region) 



1 CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG 
eTAGTAGTTA TTATATGGAA TAAAACCTAA CTTCGGTTAT ACTATTACTC 

51 GGG6TGGAGT TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG 
CCCCACCTCA AACACTOCAC CGCGCCCCGC ACCCTTQCCC CGCCCACTGC 

101 TAGTAGTGTG GCGGAAGTGT GATGTTGCAA GTGTGGCGGA ACACATGTAA 
ATCATCACAC CGCCTTCACA CTACAACGTT CACACCGCCT TGTGTACATT 

151 GCGACGGATG TGGCAAAAGT GACGTTTTTG GTGTGCGCCG GTGTACACAG 
CGCTGCCTAC ACCGTTTTCA CTGCAAAAAC CACACGCGGC CACATGTGTC 

201 GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG TAAATTTGG6 
CTTCACTGTT AAAAGCGCGC CAAAATCCGC CTACAACATC ATTTAAACCC 

251 CGTAACCGAG TAAGATTTGG CCATTTTOGC 6GGAAAACTG AATAAGAGGA 
GCATTGGCTC ATTCTAAACC GGTAAAAQCG CCCTTTTCAC TTATTCTCCT 

301 AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA 
TCACTTTAGA CTTATTAAAA CACAATGAGT ATCGCGCATT ATAAACAGAT 

351 GGGCCGCGGG GACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT 
CCCGGCGCCC CTGAAACTGG CAAATGCACC TCTGAGCGGG TCCACAAAAA 

401 CTCAGGTGTT TTCCGCGTTC CGGGTCAAAG TTGGCGTTTT ATTATTATAG 
GAGTCCACAA AAGGCGCAA6 GCCCAGTTTC AACCGCAAAA TAATAATATC 

451 CCGGCCGCGA TCCATTGCAT ACGTTGTATC CATATCATAA TATGTACATT 
CGCCGGCGCT A6GTAACGTA TGCAACATAG GTATAGTATT ATACATGTAA 

501 TATATTGGCT CATGTCCAAC ATTACCGCCA TGTTGACATT GATTATT6AC 
ATATAACCGA GTACAGGOTG TAATGGCGGT ACAACTGTAA CTAATAACTG 

551 TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTACTTCAT AGCCCATATA 
ATCAATAATT ATCATTAGTT AATGCCCCA6 TAATCAAGTA TCGGGTATAT 

601 TGGAGTTCCG CGTTACATAA CrTACGGTAA ATGGCCCGCC TGGCTGACCG 
ACCTCAAGGC GCAATGTATT GAATGCCATT TACCGGGCG6 ACCGACTGGC 

651 CCCAACGACC CCC6CCCATT GACGTCAATA ATGACGTATG TTCCCATAGT 
GGGTTGCTGG GGGCGGGTAA CTGCA6TTAT TACTGCATAC AAGGGTATCA 

701 AACGCCAATA GGGACTTTCC ATT6ACGTCA ATG06TGGAG TATTTACGGT 
. TTCCGGTTAT CCCTGAAAGG TAACTGCAGT TACCXJlCCTC ATAAATGCCA 

751 AAACTGCCCA CITGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC 
TTTGACGGGT CAACCGTCAT GTAGTTCACA TAGTATACGG TTCATGCGGG 

801 CCTATTGACG TCAATGAC6G TAAAT6GCCC GCCTGGCATT ATGCCCAGTA 
GGATAACTGC AGTTACTGCC ATirFACCGGG CGGACCGTAA TACGGGTCAT 

851 CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA 
GTACTGGAAT ACCCTGAAAG GATGAACCGT CATGTAGAT6 CATAATCAGT 
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901 TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA 
AGCGATAATG GTACCACTAC GCCAAAACCG TCATGTAGTT ACCCGCACCT 

951 TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 
ATCGCCAAAC TGAGTGCCCC TAAAGGTTCA GAGGTGGGGT AACTGCAGTT 

1001 TGGGAGTTTG TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA . 
ACCCTCAAAC AAAACCGTGG TTTTAGTTGC CCTGAAAGGT TTTACAGCAT 

1051 ACAACTCCGC CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG 
TGTTGAGGCG GGGTAACTGC GTTTACCCGC CATCCGCACA TGCCACCCTC 

1101 GTCTATATAA GCAGAGCTCG TTTAGTGAAC CGTCAGATCG CCTGGAGACG 
CAGATATATT CGTCTCGAGC AAATCACTTG GCAGTCTAGC GGACCTCTGC 

1151 CCATCCACGC TGTTTTGACC TCCATAGAAG ACACCGGGAC CGATCCAGCC 
GGTAGGTGCG ACAAAACTGG AGGTATCTTC TGTGGCCCTG GCTAGGTCGG 

1201 TCCGCGGCCG GGAACGGTGC ATTGGAACGC GGATTCCCCG TGCCAAGAGT 
AGGCGCCGGC CCTT6CCACG TAACCTTGCG CCTAAGGGGC ACGGTTCTCA 

1251 GAGATCTACC ATGGCCCCCA TCTCCCCCAT TGAGACTGTG CCTGTGAAGC 
CTCTAGATGG TACCGGGGGT AGAGGGGGTA ACTCTGACAC GGACACTTCG 

1301 TGAAGCCTG6 CATGGATGGC CCCAAGGTGA AGCAGTGGCC CCTCACTGAG 
ACTTCGGACC GTACCTACCG GGGTTCCACT TCGTCACCGG GGACTGACTC • 

1351 GAGAAGATCA AGGCCCTGGT GGAAATCTGC ACTGAGATGG AGAAGGAGGG 
CTCTTCTAGT TCCGGGACGA CCTTTAGACG TGACTCTACC TCTTCCTCCC 

1401 CAAAATCTCC AAGATTGGCC CCGAGAACCC CTACAACACC CCTGTGTTTG 
GTTTTAGAGG TTCTAACCGG GGCTCTTGGG GATGTTGTGG GGACACAAAC 

1451 CCATCAAGAA GAAGGACTCC ACCAAGTGGA GGAAGCTGGT GGACTTCAGG 
GGTAGTTCTT CTTCCTGAGG TGGTTCACCT CCTTCGACCA CCTGAAGTCC 

1501 GAGCTGAACA AGAGGACCCA GGACTTCTGG GAGGTGCAGC TGGGCATCCC 
CTCGACTTGT TCTCCTGGGT CCTGAAGACC CTCCACGTCG ACCCGTAGGG 

1551 CCACCCCGCT GGCCTGAAGA AGAAGAAGTC TGTGACTGTG CTGGCTGTGG 
GGTGGGGCGA CCGGACTTCT TCTTCTTCAG ACACTGACAC GACCGACACC 

1601 GGGATGCCTA CTTCTCTGTG CCCCTGGATG AGGACTTCAG GAAGTACACT 
CCCTAC6GAT GAAGAGACAC GGGGACCTAC TCCTGAAGTC CTTCAT6TGA 

1651 GCCTTCACCA TCCCCTCCAT CAACAATGAG ACCCCTGGCA TCAGGTACCA 
CGGAAGTGGT AGGGGAGGTA GTTGTTACTC TGGGGACCGT AGTCCATGGT 

1701 GTACAATGTG CTGCCCCAGG GCTGGAAGGG CTCCCCTGCC ATCTTCCAGT 
CATGTTACAC GACGGGGTCC CGACCTTCCC GAGGGGACGG TAGAAGGTCA 

1751 CCTCCATGAC CAAGATCCTG GAGCCCTTCA GGAAGCAGAA CCCT6ACATT 
GGAGGTACTG GTTCTAGGAC CTCGGGAAGT CCTTCGTCTT GGGACTGTAA 

1801 GTGATCTACC AGTACATGGC TGCCCTGTAT GTGGGCTCTG ACCTGGAGAT 
CACTAGATGG TCATGTACCG ACGGGACATA CACCCGAGAC TGGACCTCTA 
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1901 GGGGCCTGAC CACCCCTGAC AAGAAGCACC AGAAGGAGCC CCCCTTCCTG 
CCCCGGACTG GTGGGGACTG TTCTTCGTGG TCTTCCTCGG GGGGAAGGAC 

1951 TGGATGGGCT ATGAGCTGCA CCCCGACAAG TGGACTGTGC AGCCCATTGT 
ACCTACCCGA TACTCGACGT GGGGCTGTTC ACCTGACACG TCGGGTAACA 

2001 GCTGCCTGAG AAGGACTCCT GGACTGTGAA TGACATCCAG AAGCTGGTGG 
CGACGGACTC TTCCTGAGGA CCTGACACTT ACTGTAGGTC TTCGACCACC 

2051 GCAAGCTGAA CTGGGCCTCC CAAATCTACC CTGGCATCAA GGTGAGGCAG 
CGTTCGACTT GACCCGGAGG GTTTAGATGG GACCGTA6TT CCACTCCGTC 

2101 CTGTGCAAGC TGCTGAGGGG CACCAAGGCC CTGACTGAGG TGATCCCCCT 
GACACGTTCG AGGACTCCCC GTGGTTCCGG GACTGACTCC ACTAGGGGGA 

2151 GACTGAGGAG GCTGAGCTGG AGCTGGCTGA GAACAGGGAG ATCCTGAAGG 
CTGACTCCTC CGACTCGACC TCGACCGACT CTTGTCCCTC TAGGACTTCC 

2201 AGCCT6TGCA TGGGGTGTAC TATGACCCCT CCAAGGACCT GATTGCTGAG 
TCGGACACGT ACCCCACATG ATACTGGGGA GGTTCCTGGA CTAACGACTC 

2251 ATCCAGAAGC AGGGCCAGGG CCAGTGGACC TACCAAATCT ACCAGGAGCC 
TAGGTCTTCG TCCCGGTCCC GGTCACCTGG ATGGTTTAGA TGGTCCTCGG 

2301 CTTCAAGAAC CTGAAGACTG GCAAGTATGC CAGGATGAGG GGGGCCCACA 
GAAGTTCTTG GACTTCTGAC CGTTCATACG GTCCTACTCC CCCCGGGTGT 

2351 CCAATGATGT GAAGCAGCTG ACTGAGGCTG TGCAGAAGAT CACCACTGAG 
GGTTACTACA CTTCGTCGAC TGACTCCGAC ACGTCTTCTA GTGGTGACTC 

2401 TCCATTGTGA TCTGGGGCAA GACCCCCAAG TTCAAGCTGC CCATCCAGAA 
AGGTAACACT AGACCCCGTT CTGGGGGTTC AAGTTCGACG GGTAGGTCTT 

2451 GGAGACCTGG GAGACCTGGT GGACTGAGTA CTGGCAGGCC ACCTGGATCC 
CCTCTGGACC CTCTGGACCA CCTGACTCAT GACCGTCCGG TGGACCTAGG 

2501 CTGAGTGGGA GTTTGTGAAC ACCCCCCCCC TGGTGAAGCT GTGGTACCAG 
GACTCACCCT CAAACACTTG TGGGGGGGGG ACCACTTCGA CACCATGGTC 

2551 CTGGAGAAGG AGCCCATTGT GGGGGCTGAG ACCTTCTATG TGGCTGGGGC 
GACCTCTTCC TCGGGTAACA CCCCCGACTC TGGAAGATAC ACCGACCCCG 

2601 TGCCAACAGG GAGACCAAGC TGGGCAAGGC TGGCTATGTG ACCAACAGGG 
ACGGTTGTCC CTCTGGTTCG ACCCGTTCCG ACCGATACAC TGGTTGTCCC 

2651 GCAGGCAGAA 66TGGTGACC CTGACTGACA CCACCAACCA GAAGACTGCC 
CGTCCGTCTT CCACCACTGG GACTGACTGT GGTGGTTGGT CTTCTGACGG 

2701 CTCCAGGCCA TCTACCTGGC CCTCCAGGAC TCTGGCCTGG AGGTGAACAT 
GAGGTCCGGT AGATGGACCG GGAGGTCCTG AGACCGGACC TCCACTTGTA 

2751 TGTGACTGCC TCCCAGTATG CCCTGGGCAT CATCCAGGCC CAGCCTGATC 
ACACTGACGG AGGGTCATAC GGGACCCGTA isTAGGTCCGG GTCGGACTAG 
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GAGAAGGTGT ACCTGGCCTG GGTGCCTGCC CACAAGGGCA TTGGGGGCAA 
CTCTTCCACA T6GACCGGAC CCACGGACGG GTGTTCCCGT AACCCCCGTT 

TGAGCAGGTG GACAAGCTGG TGTCTGCTGG CATCAGGAAG GTGCTGTTCC 
ACTCGTCCAC CTGTTCGACC ACAGACGACC GTAGTCCTTC CACGACAAGG 

TGGATGGCAT TGACAAGGCC CAGGATGAGC ATGAGAAGTA CCACTCCAAC 
ACCTACCGTA ACTGTTCCGG GTCCTACTCG TACTCTTCAT GGTGAGGTTG 

TGGAGGGCTA TGGCCTCTGA CTTCAACCTG CCCCCTGTGG TGGCTAAGGA 
ACCTCCCGAT ACCGGAGACT GAAGTTGGAC GGGGGACACC ACCGATTCCT 

GATTGTGGCC TCCTGTGACA AGTGCCAGCT GAAGGGGGAG GCCATGCATG 
CTAACACCGG AGGACACTGT TCAC6GTCGA CTTCCCCCTC CGGTACGTAC 

GGCAGGTGGA CTGCTCCCCT GGCATCTGGC AGCTGGCCTG CACCCACCTG 
CCGTCCACCT GACGAGGGGA CCGTAGACCG TCGACCGGAC GTGGGTGGAC 

GAGGGCAAGG TGATCCTGGT GGCTGTGCAT GTGGCCTCCG GCTACATTGA 
CTCCCGTTCC ACTAGGACCA CCGACACGTA CACCGGAGGC CGATGTAACT 

GGCTGAGGTG ATCCCTGCTG AGACAGGCCA GGAGACTGCC TACTTCCTGC 
CCGACTCCAC TAGGGACGAC TCTGTCCGGT CCTCTGACGG ATGAAGGACG 

TGAAGCTGGC TGGCAGGTGG CCTGTGAAGA CCATCCACAC TGCCAATGGC 
ACTTCGACCG ACCGTCCACC GGACACTTCT GGTAGGT6TG ACGGTTACCG 

TCCAACTTCA CTGGGGCCAC AGTGAGGGCT GCCTGCTGGT GGGCTGGCAT 
AGGTTGAAGT GACCCCGGTG TCACTCCCGA CGGACGACCA CCCGACCGTA 

CAAGCAGGAG TTTGGCATCC CCTACAACCC CCAGTCCCAG GGGGTGGTGG 
GTTCGTCCTC AAACCGTAGG GGATGTTGGG GGTCAGGGTC CCCCACCACC 

CCTCCATGAA CAAGGAGCTG AAGAAGATCA TTGGGCAGGT GAGGGACCAG 
GGAGGTACTT GTTCCTCGAC TTCTTCTAGT AACCCGTCCA CTCCCTGGTC 

GCTGAGCACC TGAAGACAGC TGTGCAGATG GCTGTGTTCA TCCACAACTT 
CGACTCGTGG ACTTCTGTCG ACACGTCTAC CGACACAAGT AGGTGTTGAA 

CAAGAGGAAG GGGGGCATCG GGGGCTACTC CGCTGGGGAG AGGATTGTGG 
GTTCTCCTTC CCCCCGTAGC CCCCGATGAG GC6ACCCCTC TCCTAACACC 

ACATCATTGC CACAGACATC CAGACCAAGG AGCTCCA6AA GCAGATCACC 
TGTAGTAACG GTGTCTGTAG GTCTG6TTCC TCGAGGTCTT CGTCTAGTGG 

AAGATCCAGA ACTTCAGGGT 6TACTACAGG GACTCCAGGA ACCCCCTGTG 
TTCTAGGTCT TGAAGTCCCA CATGATGTCC CTGAGGTCCT TGGGGGACAC 

GAAGGGCCCT GCCAAGCTGC T6TGGAAGGG GGAGGGGGCT GTGGTGATCC 
CTTCCCGGGA CGGTTCGACG ACACCTTCCC CCTCCCCCGA CACCACTA6G 

AGGACAACTC TGACATCAAG GTGGTGCCCA GGAGGAAGGC CAAGATCATC 
TCCTGTTGAG ACTGTAGTTC CACCACGGGT CCTCCTTCCG GTTCTAGTAG 
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3801 GGATGAGGAC TAAAGCCCGG GCAGATCTGC TGTGCCTTCT AGTTGCCAGC 
CCTACTCCTG ATTTCGGGCC CGTCTAGACG ACACGGAAGA TCAACGGTCG 

3851 CATCTGTTGT TTGCCCCTCC CCCGTGCCTT CCTTGACCCT GGAAGGTGCC 
GTAGACAACA AACGGGGAGG GGGCACGGAA GGAACTGGGA CCTTCCACGG 

3901. ACTCCCACTG TCCTTTCCTA ATAAAATGAG GAAATTGCAT CGCATTGTCT 
TGAGGGTGAC AGGAAAGGAT TATTTTACTC CTTTAACGTA GCGTAACAGA 

3951 GAGTAGGTGT CATTCTATTC TGGGGGGTGG GGTGGGGCAG GACAGCAAGG 
CTCATCCACA GTAAGATAAG ACCCCCCACC CCACCCCGTC CTGTCGTTCC 

4001 GGGAGGATTG GGAAGACAAT AGCAGGCATG CTGGGGATGC GGTGGGCTCT 
CCCTCCTAAC CCTTCTGTTA TCGTCCGTAC GACCCCTACG CCACCCGAGA 

4051 ATGGCCGATC GGCGCGCCGT ACTGAAATGT GTGGGCGTGG CTTAAGGGTG 
TACCGGCTAG CCGCGCGGCA TGACTTTACA CACCCGCACC GAATTCCCAC 

4101 GGAAAGAATA TATAAGGTGG GGGTCTTATG TAGTTTTGTA TCTGTTTTGC 
CCTTTCTTAT ATATTCCACC CCCAGAATAC ATCAAAACAT AGACAAAACG 

4151 AGCAGCCGCC GCCGCCATGA GCACCAACTC GTTTGATGGA AGCATTGTGA 
TCGTCGGCGG CGGCGGTACT CGTGGTTGAG CAAACTACCT TCGTAACACT 

4201 GCTCATATTT GACAACGCGC ATGCCCCCAT GGGCCGGGGT GCGTCAGAAT 
CGAGTATAAA CTGTTGCGCG TACGGGGGTA CCCGGCCCCA CGCAGTCTTA 

4251 GTGATGGGCT CCAGCATTGA TGGTCGCCCC GTCCTGCCCG CAAACTCTAC 
CACTACCCGA GGTCGTAACT ACCAGCGGGG CAGGACGGGC GTTTGAGATG 

4301 TACCTTGACC TACGAGACCG TGTCTGGAAC GCCGTTGGAG ACTGCAGCCT 
ATGGAACTGG ATGCTCTGGC ACAGACCTTG CGGCAACCTC TGACGTCGGA 

4351 CCGCCGCCGC TTCAGCCGCT GCAGCCACCG CCCGCGGGAT TGTGACTGAC 
GGCGGCGGCG AAGTCGGCGA CGTCGGTGGC GGGCGCCCTA ACACTGACTG 

4401 TTTGCTTTCC TGAGCCCGCT TGCAAACAGT GCAGCTTCCC GTTCATCCGC 
AAACGAAAGG ACTCGGGCGA ACGTTTGTCA CGTCGAAGGG CAAGTAGGCG 

4451 CCGCGATGAC AAGTTGACGG CTCTTTTGGC ACAATTGGAT TCTTTGACCC 
GGCGCTACTG TTCAACTGCC GAGAAAACCG TGTTAACCTA AGAAACTGGG 

4501 GGGAACTTAA TGTCGTTTCT CAGCAGCTGT TGGATCTGCG CCAGCAGGTT 
CCCTTGAATT ACAGCAAAGA GTCGTCGACA ACCTAGACGC GGTCGTCCAA 

4551 TCTGCCCTGA AGGCTTCCTC CCCTCCCAAT GCGGTTTAAA ACATAAATAA 
AGACGGGACT TCCGAAGGAG GGGA6GGTTA CGCCAAATTT T6TATTTATT 

4601 AAAACCAGAC TCTGTTTGGA TTTGGATCAA GCAAGTGTCT TGCTGTCTTT 
TTTTGGTCTG AGACAAACCT AAACCTAGTT CGTTCACAGA ACGACAGAAA 

4651 ATTTAGGGGT TTTGCGCGCG CGGTAGGCCC GGGACCAGCG GTCTCGGTCG 
TAAATCCCCA AAACGCGCGC GCCATCCGGG CCCTGGTCGC CAGAGCCAGC 
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GTTCAGATAC ATGGGCATAA GCCCGTCTCT GGGGTGGAGG TAGCACCACT 
CAAGTCTATG . TACCCGTATT CGGGCAGAGA CCCCACCTCC ATCGTGGTGA 

GCAGAGCTTC ATGCTGCGGG GTGGTGTTGT AGATGATCCA GTCGTAGCAG 
CGTCTCGAAG TACGACGCCC CACCACAACA TCTACTAGGT CAGCATCGTC 

GAGCGCTGGG CGTGGTGCCT AAAAATGTCT TTCAGTAGCA AGCTGATTGC 
CTCGCGACCC GCACCACGGA TTTTTACAGA AAGTCATCGT TCGACTAACG 

CAGGGGCAGG CCCTTGGTGT AAGTGTTTAC AAAGCGGTTA AGCTGGGATG 
GTCCCCGTCC GGGAACCACA TTCACAAATG TTTC6CCAAT TCGACCCTAC 

GGTGCATACG TGGGGATATG AGATGCATCT TGGACTGTAT TTTTAGGTTG 
CCACGTATGC ACCCCTATAC TCTACGTAGA ACCTGACATA AAAATCCAAC 

GCTATGTTCC CAGCCATATC CCTCCGGGGA TTCATGTTGT GCAGAACCAC 
CGATACAAGG GTCGGTATAG GGAGGCCCCT AAGTACAACA CGTCTTGGTG 

CAGCACAGTG TATCCGGTGC ACTTGGGAAA TTTGTCATGT AGCTTAGAAG 
GTCGTGTCAC ATAGGCCACG TGAACCCTTT AAACAGTACA TCGAATCTTC 

GAAATGCGTG GAAGAACTTG GAGACGCCCT TGTGACCTCC AAGATTTTCC 
CTTTACGCAC CTTCTTGAAC CTCTGCGGGA ACACTGGAGG TTCTAAAAGG 

ATGCATTCGT CCATAATGAT GGCAATGGGC CCACGGGCGG CGGCCTGGGC 
TACGTAAGCA GGTATTACTA CCGTTACCCG GGTGCCCGCC GCCGGACCCG 

GAAGATATTT CTGGGATCAC TAACGTCATA GTTGTGTTCC AGGATGAGAT 
CTTCTATAAA GACCCTAGTG ATTGCAGTAT CAACACAAGG TCCTACTCTA 

CGTCATAGGC CATTTTTACA AAGCGCGGGC GGAGGGTGCC AGACTGCGGT 
GCAGTATCCG GTAAAAATGT TTCGCGCCCG CCTCCCACGG TCTGACGCCA 

ATAATGGTTC CATCCGGCCC AGGGGCGTAG TTACCCTCAC AGATTTGCAT 
TATTACCAAG GTAGGCCGGG TCCCCGCATC AATGGGAGTG TCTAAACGTA 

TTCCCACGCT TTGAGTTCAG ATGGGGGGAT CATGTCTACC TGCGGGGCGA 
AAGGGTGCGA AACTCAAGTC TACCCCCCTA GTACA6ATGG ACGCCCCGCT 

TGAAGAAAAC GGTTTCCGGG GTAGGGGAGA TCAGCTGGGA AGAAAGCAGG 
ACTTCTTTTG CCAAAGGCCC CATCCCCTCT AGTCGACCCT TCTTTCGTCC 

TTCCTGAGCA GCTGCGACTT ACCGCAGCCG GTGGGCCCGT AAATCACACC 
AAGGACTCGT CGACGCTGAA TGGCGTCGGC CACCCGGGCA TTTAGTGTGG 

TATTACCGGC TGCAACTGGT AGTTAAGAGA GCTGCAGCTG CCGTCATCCC 
ATAATGGCCG ACGTTGACCA TCAATTCTCT CGACGTCGAC GGCAGTAGGG 

TGAGCAGGGG GGCCACTTCG TTAAGCATGT CCCTGACTCG CATGTTTTCC 
ACTCGTCCCC CCGGTGAAGC AATTC6TACA GGGACTGAGC GTACAAAA6G 

CTGACCAAAT CCGCCAGiAAG GCGCTCGCCG CCCAGCGATA GCAGTTCTTG 
GACTGGTTTA GGCGGTCTTC CGCGAGCGGC GGGTCGCTAT CGTCAAGAAC 
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5701 TTTTGAGCGT TTGACCAAGC AGTTCCAGGC GGTCCCACAG CTCGGTCACC 
AAAACTCGCA AACTGGTTCG TCAAGGTCCG CCAGGGTGTC GAGCCAGTGG 

5751 TGCTCTACGG CATCTCGATC CAGCATATCT CCTCGTTTCG CGGGTTGGGG 
ACGAGATGCC GTAGAGCTAG GTCGTATAGA GGAGCAAAGC GCCCAACCCC 

5801 CGGCTTTCGC TGTACGGCAG TAGTCGGTGC TCGTCCAGAC GGGCCAGGGT 
GCCGAAAGCG ACATGCCGTC ATCAGCCACG AGCAGGTCTG CCCGGTCCCA 

5851 CATGTCTTTC CACGGGCGCA GGGTCCTCGT CAGCGTAGTC TGGGTCACGG 
GTACAGAAAG GTGCCCGCGT CCCAGGAGCA GTCGCATCAG ACCCAGTGCC 

5901 TGAAGGGGTG CGCTCCGGGC TGCGCGCTGG CCAGGGTGCG CTTGAGGCTG 
ACTTCCCCAC GCGAGGCCCG ACGCGCGACC GGTCCCACGC GAACTCCGAC 

5951 GTCCTGCTGG TGCTGAAGCG CTGCCGGTCT TCGCCCTGCG CGTCGGCCAG 
CAGGACGACC ACGACTTCGC GACGGCCAGA AGCGGGACGC GCAGCC66TC 

6001 GTAGCATTTG ACCATGGTGT CATAGTCCAG CCCCTCCGCG GCGTGGCCCT 
CATCGTAAAC TGGTACCACA GTATCAGGTC GGGGAGGCGC CGCACCGGGA 

6 or-: TGGCGCGCAG CTTGCCCTTG GAGGAGGCGC CGCACGAGGG GCAGTGCAGA 
ACCGCGCGTC GAACGGGAAC CTCCTCCGCG GCGTGCTCCC CGTCACGTCT 

6101 CTTTTGAGGG CGTAGAGCTT GGGCGCGAGA AATACCGATT CCGGGGAGTA 
GAAAACTCCC GCATCTCGAA CCCGCGCTCT TTATGGCTAA GGCCCCTCAT 

6151 GGCATCCGCG CCGCAGGCCC CGCAGACGGT CTCGCATTCC ACGAGCCAGG 
CCGTAGGCGC GGCGTCCGGG GCGTCTGCCA GAGCGTAAGG TGCTCGGTCC 

6201 TGAGCTCTGG CCGTTCGGGG TCAAAAACCA GGTTTCCCCC ATGCTTTTTG 
ACTCGAGACC GGCAAGCCCC AGTTTTTGGT CCAAAGGGGG TACGAAAAAC 

6251 ATGCGTTTCT TACCTCTGGT TTCCATGAGC CGGTGTCCAC GCTCGGTGAC 
TACGCAAAGA ATGGAGACCA AAGGTACTCG GCCACAGGTG CGAGCCACTG 

6301 GAAAAGGCTG TCCGTGTCCC CGTATACAGA CTTGAGAGGC CTGTCCTCGA 
CTTTTCCGAC AGGCACAGGG GCATATGTCT GAACTCTCCG GACAGGAGCT 

6351 GCGGTGTTCC GCGGTCCTCC TCGTATAGAA ACTCGGACCA CTCTGAGACA 
CGCCACAAGG CGCCAGGAGG AGCATATCTT TGAGCCTGGT GAGACTCTGT 

6401 AAGGCTCGCG TCCAGGCCAG CACGAAGGAG GCTAAGTGGG AGGGGTAGCG 
TTCCGAGCGC AGGTCCGGTC 6TGCTTCCTC CGATTCACCC TCCCCATCGC 

6451 GTCGTTGTCC ACTAGGGGGT CCACTCGCTC CAGGGTGTGA AGACACATGT 
CAGCAACAGG TGATCCCCCA GGTGAGCGAG GTCCCACACT TCTGTGTACA 

6501 CGCCCTCTTC GGCATCAAGG AAGGTGATTG GTTTGTAGGT GTAGGCCACG 
GCGGGAGAAG CCGTAGTTCC TTCCACTAAC CAAACATCCA CATCCGGTGC 

6551 TGACCGGGTG TTCCTGAAGG GGGGCTATAA AAGGGGGTGG GGGCGCGTTC 
ACTGGCCCAC AAGGACTTCC CCCCGATATT TTCCCCCACC CCCGCGCAAG 
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AGTACTCCCT CT6AAAAGCG GGCATGACTT CTGCGCTAAG ATTGTCAGTT 
TCATGAGGGA GACTTTTCGC CCGTACTGAA GACGCGATTC TAACAGTCAA 

TCCAAAAACG AGGAGGATTT GATATTCACC TGGCeCGCGG TGATGCCTTT 
AGGTTTTTGC TCCTCCTAAA CTATAAGTGG ACCGGGCGCC ACTACGGAAA 

GAGGGTGGCC GCATCCATCT GGTCAGAAAA GACAATCTTT TTGTTGTCAA 
CTCCCACCGG CGTAGGTAGA CCAGTCTTTT CTGTTAGAAA AACAACAGTT 

GCTTGGTGGC AAACGACCCG TAGAGGGCGT TGGACAGCAA CTTGGCGATG 
CGAACCACCG TTTGCTGGGC ATCTCCCGCA ACCTGTCGTT GAACCGCTAC 

GAGCGCAGGG TTTGGTTTTT GTCGCGATCG GCGCGCTCCT TGGCCGCGAT 
CTCGCGTCCC AAACCAAAAA CAGCGCTAGC CGCGCGAGGA ACCGGCGCTA 

GTTTAGCTGC ACGTATTCGC GCGCAACGCA CCGCCATTCG GGAAAGACGG 
CAAATCGACG TGCATAAGCG CGCGTTGCGT GGCGGTAAGC CCTTTCTGCC 

TGGTGCGCTC GTCGGGCACC AGGTGCACGC GCCAACCGCG GTTGTGCAGG 
ACCACGCGAG CAGCCCGTGG TCCACGT6CG CGGTTGGCGC CAACACGTCC 

GTGACAAGGT CAACGCTGGT GGCTACCTCT CCGCGTAGGC GCTCGTTGGT 
CACTGTTCCA GTTGCGACCA CCGATGGAGA GGCGCATCCG CGAGCAACCA 

CCAGCAGAGG CGGCCGCCCT TGCGCGAGCA GAATGGCGGT AGGGGGTCTA 
GGTCGTCTCC GCCGGCGGGA ACGCGCTCGT CTTACCGCCA TCCCCCAGAT 

GCTGCGTCTC GTCCGQGGGG TCTGCGTCCA CGGTAAAGAC CCCGGGCAGC 
CGACGCAGAG CAGGCCCCCC AGACGCAGGT GCCATTTCTG GGGCCCGTCG 

AGGCGCGCGT CGAAGTAGTC TATCTTGCAT CCTTGCAAGT CTAGCGCCTG 
TCCGCGCGCA GCTTCATCAG ATAGAACGTA GGAACGTTCA GATCGCGGAC 

CTGCCATGCG CGGGCGGCAA GCGCGCGCTC GTATGGGTTG AGTGGGGGAC 
GACGGTACGC GCCCGCCGTT CGCGCGCGAG CATACCCAAC TCACCCCCTG 

CCCATGGCAT GGGGTGGGTG AGCGCGGAGG CGTACATGCC GCAAATGTCG 
GGGTACCGTA CCCCACCCAC TCGCGCCTCC GCATGTACGG CGTTTACAGC 

TAAACGTAGA GGGGCTCTCT GAGTATTCCA AGATATGTAG GGTAGCATCT 
ATTTGCATCT CCCCGAGAGA CTCATAAGGT TCTATACATC CCATCGTAGA 

TCCACCGCGG ATGCTGGCGC GCACGTAATC GTATAGTTCG TGCGAGGGAG 
AGGTGGCGCC TACGACCGCG CGTGCATTAG CATATCAAGC ACGCTCCCTC 

CGAGGAGGTC GGGACCGAGG TTGCTACGGG CGGGCTGCTC TGCTCGGAAG 
GCTCCTCCAG CCCTGGCTCC AACGATGCCC . GCCCGACGAG ACGAGCCTTC 

ACTATCTGCC TGAAGATGGC ATGTGAGTTG GATGATATGG TTGGACGCTG 
TGATAGACGG ACTTCTACCG TACACTCAAC CTACTATACC AACCTGCGAC 

GAAGACGTTG AAGCTGGCGT CTGTGAGACC TACCGCGTCA CGCACGAAGG 
CTTCTGCAAC TTCGACCGCA GACACTCTGG ATGGCGCAGT GCGTGCTTCC 
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TCTAGGGCGC AGTAGTCCAG GGTTTCCTTG ATGATGTCAT ACTTATCCTG 
AGATCCCGCG TCATCAGGTC CCAAAGGAAC TACTACAGTA TGAATAGGAC 

TCCCTTTTTT TTCCACAGCT CGCGGTTGAG GACAAACTCT TCGCGGTCTT 
AGGGAAAAAA AAGGTGTCGA GCGCCAACTC CTGTTTGAGA AGCGCCAGAA 

TCCAGTACTC TTGGATCGGA AACCCGTCGG CCTCCGAACG GTAAGAGCCT 
AGGTCATGAG AACCTAGCCT TTGGGCAGCC GGAGGCTTGC CATTCTCGGA 

AGCATGTAGA ACTGGTTGAC GGCCTGGTAG GCGCAGCATC CCTTTTCTAC 
TCGTACATCT TGACCAACTG CCGGACCATC CGCGTCGTAG GGAAAAGATG 

GGGTAGCGCG TATGCCTGCG CGGCCTTCCG GAGCGAGGTG TGGGTGAGCG 
CCCATCGCGC ATACGGACGC GCCGGAAGGC CTCGCTCCAC ACCCACTCGC 

CAAAGGTGTC CCTGACCATG ACTTTGAGGT ACTGGTATTT GAAGTCAGTG 
GTTTCCACAG GGACTGGTAC TGAAACTCCA TGACCATAAA CTTCAGTCAC 

TCGTCGCATC CGCCCTGCTC CCAGAGCAAA AAGTCCGTG.C GCTTTTTGGA 
AGCAGCGTAG GCGGGACGAG GGTCTCGTTT TTCAGGCACG CGAAAAACCT 

ACGCGGATTT GGCAGGGCGA AGGTGACATC GTTGAAGAGT ATCTTTCCCG 
TGCGCCTAAA CCGTCCCGCT TCCACTGTAG CAACTTCTCA TAGAAAGGGC 

CGCGAGGCAT AAAGTTGCGT GTGATGCGGA AGGGTCCCGG CACCTCGGAA 
GCGCTCCGTA TTTCAACGCA CACTACGCCT TCCCAGGGCC GTGGAGCCTT 

CGGTTGTTAA TTACCTGGGC GGCGAGCACG ATCTCGTCAA AGCCGTTGAT 
GCCAACAATT AATGGACCCG CCGCTCGTGC TAGAGCAGTT TCGGCAACTA 

GTTGTGGCCC ACAATGTAAA GTTCCAAGAA GCGCGGGATG CCCTTGATGG 
CAACACCGGG TGTTACATTT CAAGGTTCTT CGCGCCCTAC GGGAACTACC 

AAGGCAATTT TTTAAGTTCC TCGTAGGTGA GCTCTTCAGG GGAGCTGAGC 
TTCCGTTAAA AAATTCAAGG AGCATCCACT CGAGAAGTCC CCTCGACTCG 

CCGTGCTCTG AAAGGGCCCA GTCTGCAAGA TGAGGGTTGG AAGCGACGAA 
GGCACGAGAC TTTCCCGGGT CAGACGTTCT ACTCCCAACC TTCGCTGCTT 

TGAGCTCCAC AGGTCACGGG CCATTAGCAT TTGCAGGTGG TCGCGAAAGG 
ACTCGAGGTG TCCAGTGCCC GGTAATCGTA AACGTCCACC AGCGCTTTCC 

tCCTAAACTG GCGACCTATG GCCATTTTTT CTGGGGTGAT GCAGTAGAAG 
AGGATTTGAC CGCTGGATAC CGGTAAAAAA GACCCCACTA CGTCATCTTC 

GTAAGCGGGT CTTGTTCCCA GCGGTCCCAT CCAAGGTTCG CGGCTAGGTC 
CATTCGCCCA GAACAAGGGT CGCCAGGGTA GGTTCCAAGC GCCGATCCAG 

TCGCGCGGCA GTCACTAGAG GCTCATCTCC GCCGAACTTC ATGACCAGCA 
AGCGCGCCGT CAGTGATCTC C6AGTAGAGG CGGCTTGAAG TACTGGTCGT 

TGAAGGGCAC GAGCTGCTTC CCAAAGGCCC CCATCCAAGT ATAGGTCTCT 
ACTTCCCGTG CTCGACGAAG GGTTTCCGGG GGTAGGTTCA TATCCAGAGA 
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8551 GAAGAACTGG ATCTCCCGCC ACCAATTGGA GGAGTGGCTA TTGATGTGGT 
CTTCTTGACC TA6AGGGCGG TGGTTAACCT CCTCACCGAT AACTACACCA 

8601 GAAAGTAGAA GTCCCTGCGA CGGGCCGAAC ACTCGTGCTG GCTTTTGTAA 
CTTTCATCTT CAGGGACGCT GCCCGGCTTG TGAGCACGAC CGAAAACATT 

8651 AAACGTGCGC AGTACTGGCA GCGGTGCACG GGCTGTACAT CCTGCACGAG 
TTTGCACGCG TCATGACCGT CGCCACGTGC CCGACATGTA GGACGTGCTC 

8701 GTTGACCTGA CGACCGCGCA CAAGGAAGCA GAGTGGGAAT TTGAGCCCCT 
CAACTGGACT GCTGGCGCGT GTTCCTTCGT CTCACCCTTA AACTCGGGGA 

8751 CGCCTGGCGG GTTTGGCTGG TGGTCTTCTA CTTCGGCTGC TTGTCCTTGA 
GCGGACCGCC CAAACCGACC ACCAGAAGAT GAAGCCGACG AACAGGAACT 

8801 CCGTCTGGCT GCTCGAGGGG AGTTACGGTG GATCGGACCA CCACGCCGCG 
GGCAGACCGA CGAGCTCCCC TCAATGCCAC CTAGCCTGGT GGTGCGGCGC 

8851 CGAGCCCAAA GTCCAGATGT CCGCGCGCGG CGGTCGGAGC TTGATGACAA 
GCTCGGGTTT CAGGTCTACA GGCGCGCGCC GCCAGCCTCG AACTACTGTT 

8901 CATCGCGCAG ATGGGAGCTG TCCATGGTCT GGAGCTCCCG CGGCGTCAGG 
GTAGCGCGTC TACCCTCGAC AGGTACCAGA CCTCGAGGGC GCCGCAGTCC 

8951 TCAGGCGGGA GCTCCTGCAG GTTTACCTCG CATAGACGGG TCAGGGCGCG 
AGTCCGCCCT CGAGGACGTC CAAATGGAGC GTATCTGCCC AGTCCCGCGC 

9001 GGCTAGATCC AGGTGATACC TAATTTCCAG GGGCTGGTTG GTGGCGGCGT 
CCGATCTAGG TCCACTATGG ATTAAAGGTC CCCGACCAAC CACCGCCGCA 

9051 CGATGGCTTG CAAGAGGCCG CATCCCCGCG GCGCGACTAC GGTACCGCGC 
GCTACCGAAC GTTCTCCGGC GTAGGGGCGC CGCGCTGATG CCATGGCGCG 

9101 GGCGGGCGGT GGGCCGCGGG GGTGTCCTTG GATGATGCAT CTAAAAGCGG 
CCGCCCGCCA CCCGGCGCCC CCACAGGAAC CTACTACGTA GATTTTCGCC 

9151 TGACGCGGGC GAGCCCCCGG AGGTAGGGGG GGCTCCGGAC CCGCCGGGAG 
ACTGCGCCCG CTCGGGGGCC TCCATCCCCC CCGAGGCCTG GGCGGCCCTC 

9201 AGGGGGCAGG GGCACGTCGG CGCCGCGCGC GGGCAGGAGC TGGTGCTGCG 
TCCCCCGTCC CCGT6CA6CC GCGGCGCGCG CCCGTCCTCG ACCACGACGC 

9251 CGCGTAGGTT GCTGGCGAAC GCGACGACGC GGCGGTTGAT CTCCTGAATC 
GCGCATCCAA CGACCGCTTG CGCTGCTGCG CCGCCAACTA GAGGACTTAG 

9301 TGGCGCCTCT GCGTGAAGAC GACGGGCCCG 6TGAGCTTGA ACCTGAAAGA 
ACCGCGGAGA CGCACTTCTG CTGCCCGGGC CACTCGAACT TGGACTTTCT 

9351 GAGTTCGACA GAATCAATTT CGGTGTCGTT GACGGCGGCC TGGCGCAAAA 
CTCAAGCTGT CTTAGTTAAA GCCACAGCAA CTGCCGCCGG ACCGCGTTTT 

9401 TCTCCTGCAC GTCTCCTGAG TTGTCTT6AT AGGCGATCTC GGCCATGAAC 
AGAGGACGTG CAGAGGACTC AACAGAACTA TCCGCTAGAG CCGGTACTTG 
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GGCGGCGAGG TCGTTGGAAA TGCGGGCCAT GAGCTGCGAG AAGGCGTTGA 
CCGCCGCTCC AGCAACCTTT ACGCCCGGTA CTCGACGCTC TTCCGCAACT 

GGCCTCCCTC GTTCCAGACG CGGCTGTAGA CCACGCCCCC TTCGGCATCG 
CCGGAGGGAG CAAGGTCTGC GCCGACATCT GGTGCGGGGG AAGCCGTAGC 

CGGGCGCGCA TGACCACCTG CGCGAGATTG AGCTCCACGT GCCGGGCGAA 
GCCCGCGCGT ACTGGTGGAC GCGCTCTAAC TCGAGGTGCA CGGCCCGCTT 

GACGGCGTAG TTTCGCAGGC GCTGAAAGAG GTAGTTGAGG GTGGTGGCGG 
CTGCCGCATC AAAGCGTCCG CGACTTTCTC CATCAACTCC CACCACCGCC 

TGTGTTCTGC CACGAAGAAG TACATAACCC AGCGTCGCAA CGTGGATTCG 
ACACAAGACG GTGCTTCTTC ATGTATTGGG TCGCAGCGTT GCACCTAAGC 

TTGATATCCC CCAAGGCCTC AAGGCGCTCC ATGGCCTCGT AGAAGTCCAC 
AACTATAGGG GGTTCCGGAG TTCCGCGAGG TACCGGAGCA TCTTCAGGTG 

GGCGAAGTTG AAAAACTGGG AGTTGCGCGC CGACACGGTT AACTCCTCCT 
CCGCTTCAAC TTTTTGACCC TCAACGCGCG GCTGTGCCAA TTGAGGAGGA 

CCAGAAGACG GATGAGCTCG GCGACAGTGT CGCGCACCTC GCGCTCAAAG 
GGTCTTCTGC CTACTCGAGC CGCTGTCACA GCGCGTGGAG CGCGAGTTTC 

GCTACAGGGG CCTCTTCTTC TTCTTCAATC TCCTCTTCCA TAAGGGCCTC 
CGATGTCCCC GGAGAAGAAG AAGAAGTTAG AGGAGAAGGT ATTCCCGGAG 

CCCTTCTTCT TCTTCTGGCG GCGGTGGGGG AGGGGGGACA CGGCGGCGAC 
GGGAAGAAGA AGAAGACCGC C6CCACCCCC TCCCCCCTGT GCCGCCGCTG 

GACGGCGCAC CGGGAGGCGG TCGACAAAGC GCTCGATCAT CTCCCCGCGG 
CTGCCGCGTG GCCCTCCGCC AGCTGTTTCG CGAGCTAGTA GAGGGGCGCC 

CGACGGCGCA TGGTCTCGGT GACGGCGCGG CCGTTCTCGC GGGGGCGCAG 
GCTGCCGCGT ACCAGAGCCA CTGCCGCGCC GGCAAGAGCG CCCCCGCGTC 

TTGGAAGACG CCGCCCGTCA TGTCCCGGTT ATGGGTTGGC GGGGGGCTGC 
AACCTTCTGC GGCGGGCAGT ACAGGGCCAA TACCCAACCG CCCCCCGACG 

CATGCGGCAG GGATACGGCG CTAACGATGC ATCTCAACAA TTGTTGTGTA 
GTACGCCGTC CCTATGCCGC GATTGCTACG TAGAGTTGTT AACAACACAT 

GGTACTCCGC CGCCGAGGGA CCTGAGCGAG TCCGCATCGA CCGGATCGGA 
CCATGAGGCG GCGGCTCCCT GGACTCGCTC AGGCGTAGCT GGCCTAGCCT 

AAACCTCTCG AGAAAGGCGT CTAACCAGTC ACAGTCGCAA GGTAGGCTGA 
TTTGGAGAGC TCTTTCCGCA GATTGGTCAG TGTCAGCGTT CCATCCGACT 

GCACCGTGGC GGGCGGCAGC GGGCGGCGGT CGGGGTTGTT TCTGGCGGAG 
CGTGGCACCG CCCGCCGTCG CCCGCCGCCA GCCCCAACAA AGACCGCCTC 

GTGCTGCTGA TGATGTAATT AAAGTAGGCG GTCTTGAGAC GGCGGATGGT 
CACGACGACT ACTACATTAA TTTCATCCGC CAGAACTCTG CCGCCTACCA 
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CGGCCATGCC CCAGGCTTCG TTTTGACATC GGCGCAGGTC TTTGTAGTAG 
GCCGGTACGG GGTCCGAAGC AAAACTGTAG CCGCGTCCAG AAACATCATC 

TCTTGCATGA GCCTTTCTAC CGGCACTTCT TCTTCTCCTT CCTCTTGTCC 
AGAACGTACT CGGAAAGATG GCCGTGAAGA AGAAGAGGAA GGAGAACAGG 

TGCATCTCTT GCATCTATCG CTGCGGCG6C GGCG6AGTTT GGCCGTAGGT 
ACGTAGAGAA CGTAGATAGC 6ACGCCGCCG CCGCCTCAAA CCGGCATCCA 

GGCGCCCTCT TCCTCCCATG CGTGTGACCC CGAAGCCCCT CATCGGCTGA 
CCGCGGGAGA AGGAGGGTAC GCACACTGGG GCTTCGGGGA GTAGCCGACT 

AGCAGGGCTA GGTCGGCGAC AACGCGCTCG GCTAATATGG CCTGCTGCAC 
TCGTCCCGAT CCAGCCGCTG TTGCGCGAGC CGATTATACC GGACGACGTG 

CTGCGTGAGG GTAGACTGGA AGTCATCCAT GTCCACAAAG CGGTGGTATG 
6ACGCACTCC CATCTGACCT TCAGTAGGTA CAGGTGTTTC GCCACCATAC 

CGCCCGTGTT GATGGTGTAA GTGCAGTTGG CCATAACGGA CCAGTTAACG 
GCGGGCACAA CTACCACATT CACGTCAACC GGTATTGCCT GGTCAATTGC 

GTCTGGTGAC CCGGCTGCGA GAGCTCGGTG TACCTGAGAC GCGAGTAAGC 
CAGACCACTG GGCCGACGCT CTCGAGCCAC ATGGACTCTG CGCTCATTCG 

CCTCGAGTCA AATACGTAGT CGTTGCAAGT CCGCACCAGG TACTGGTATC 
GGAGCTCAGT TTATGCATCA GCAACGTTCA GGCGTGGTCC ATGACCATAG 

CCACCAAAAA GTGCGGCGGC GGCTGGCGGT AGAGGGGCCA GCGTAGGGTG 
GGTGGTTTTT CACGCCGCCG CCGACCGCCA TCTCCCCGGT CGCATCCCAC 

GCCGGGGCTC CGGGGGCGAG ATCTTCCAAC ATAAGGCGAT GATATCCGTA 
CGGCCCCGAG GCCCCCGCTC TAGAAGGTTG TATTCCGCTA CTATAGGCAT 

GATGTACCTG GACATCCAGG TGATGCCGGC GGCGGTGGTG GAGGCGCGCG 
CTACATGGAC CTGTAGGTCC ACTACGGCCG CCGCCACCAC CTCCGCGCGC 

GAAAGTCGCG GACGCGGTTC CAGATGTTGC GCAGCGGCAA AAAGTGCTCC 
CTTTCAGCGC CTGCGCCAAG GTCTACAACG CGTCGCCGTT TTTCACGAGG 

ATGGTCGGGA CGCTCTGGCC GGTCAGGCGC GCGCAATCGT TGACGCTCTA 
TACCAGCCCT GCGAGACCGG CCAGTCCGCG CGCGTTAGCA ACTGCGAGAT 

GACCGTGCAA AAGGAGAGCC TGTAAGCGGG CACTCTTCCG TGGTCTGGTG 
CTGGCACGTT TTCCTCTCGG ACATTCGCCC GTGAGAAGGC ACCAGACCAC 

GATAAATTCG CAAGGGTATC ATGGCGGACG ACCGGGGTTC GAGCCCCGTA 
CTATTTAAGC GTTCCCATAG TACCGCCTGC TGGCCCCAAG CTCGGGGCAT 

TCCGGCCGTC CGCCGTGATC CATGCGGTTA CCGCCCGCGT GTCGAACCCA 
AGGCCGGCAG GC0GCACTA6 6TACGCCAAT GGCGGGCGCA CAGCTTGGGT 

GGTGTGCGAC GTCAGACAAC 6GGGGAGTGC TCCTTTTGGC TTCCTTCCAG 
CCACACGCTG CAGTCTGTTG CCCCCTCACG AGGAAAACCG AAGGAAGGTC 
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11401 AAGCGGTTAG GCTGGAAAGC GAAAGCATTA AGTGGCTCGC TCCCTGTAGC 
TTCGCCAATC CGACCTTTCG CTTTCGTAAT TCACCGAGCG AGGGACATCG 

11451 CGGAGGGTTA TTTTCCAAGG GTTGAGTCGC GGGACCCCCG GTTCGAGTCT 
GCCTCCCAAT AAAAGGTTCC CAACTCAGCG CCCTGGGGGC CAAGCTCAGA 

11501 CGGACCGGCC GGACTGCGGC GAACGGGG6T TTGCCTCCCC GTCATGCAAG 
GCCTGGCCGG CCTGACGCCG CTTGCCCCCA AACGGAGGGG CAGTACGTTC 

11551 ACCCCGCTTG CAAATTCCTC CGGAAACAGG GACGAGCCCC TTTTTTGCTT 
TGGGGCGAAC GTTTAAGGAG GCCTTTGTCC CTGCTCGGGG AAAAAACGAA 

11601 TTCCCAGATG CATCCGGTGC TGCGGCAGAT GCGCCCCCCT CCTCAGCAGC 
AAGGGTCTAC GTAGGCCACG ACGCCGTCTA CGCGGGGGGA GGAGTCGTCG 

11651 GGCAAGAGCA AGAGCAGCGG CAGACATGCA GGGCACCCTC CCCTCCTCCT 
CCGTTCTCGT TCTCGTCGCC GTCTGTACGT CCCGTGGGAG GGGAGGAGGA 

11701 ACCGCGTCAG GAGGGGCGAC ATCCGCGGTT GACGCGGCAG CAGATGGTGA 
TGGCGCAGTC CTCCCCGCTG TAGGCGCCAA CTGCGCCGTC GTCTACCACT 

11751 TTACGAACCC CCGCGGCGCC GGGCCCGGCA CTACCTGGAC TTGGAGGAGG 
AATGCTTGGG GGCGCCGCGG CCCGGGCCGT GATGGACCTG AACCTCCTCC 

11801 GCGAGGGCCT GGCGCGGCTA GGAGCGCCCT CTCCTGAGCG GCACCCAAGG 
CGCTCCCGGA CCGCGCCGAT CCTCGCGGGA GAGGACTCGC CGTGGGTTCC 

11851 GTGCAGCTGA AGCGTGATAC gcgtgaggcg tacgtgccgc ggcagaacct 
CACGTCGACT TCGCACTATG CGCACTCCGC ATGCACGGCG CCGTCTTGGA 

11901 GTTTCGCGAC CGCGAGGGAG AGGAGCCCGA GGAGATGCGG GATCGAAAGT 
CAAAGCGCTG GCGCTCCCTC TCCTCGGGCT CCTCTACGCC CTAGCTTTCA 

11951 TCCACGCAGG GCGCGAGCTG CGGCATGGCC TGAATCGCGA GCGGTTGCTG 
AGGTGCGTCC CGCGCTCGAC GCCGTACCGG ACTTAGCGCT CGCCAACGAC 

12001 CGCGAGGAGG ACTTTGAGCC CGACGCGCGA ACCGGGATTA GTCCCGCGCG 
GCGCTCCTCC TGAAACTCGG GCTGCGCGCT TG6CCCTAAT CAGGGCGCGC 

12051 CGCACACGTG GCGGCCGCCG ACCTGGTAAC CGCATACGAG CAGACGGTGA 
GCGTGTGCAC CGCCGGCGGC TGGACCATTG GCGTATGCTC GTCTGCCACT 

12101 ACCAGGAGAT TAACTTTCAA AAAAGCTTTA ACAACCACGT GCGTACGCTT 
TGGTCCTCTA ATTGAAAGTT TTTTCGAAAT TGTTGGTGCA CGCATGCGAA 

12151 GTGGCGCGCG AGGAGGTGGC TATAGGACTG ATGCATCTGT GGGACTTTGT 
CACCGCGCGC TCCTCCACCG ATATCCTGAC TACGTAGACA CCCTGAAACA 

12201 AAGCGCGCTG GAGCAAAACC CAAATAGCAA GCC6CTCATG GCGCAGCTGT 
TTCGCGCGAC CTCGTTTTGG GTTTATC6TT CGGCGAGTAC CGCGTCOACA 

12251 TCCTTATAGT GCAGCACAGC AGGGACAACG AGGCATTCAG GGATGCGCTG 
AGGAATATCA CGTCGTGTCG TCCCTGTTGC TCCGTAAGTC CCTACGCGAC 
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CCTGCAGAGC ATAGTGGTGC AGGAGCGCAG CTTGAGCCTG GCT6ACAA6G 
GGACGTCTCG TATCACCACG TCCTCGCGTC GAACTCGGAC CGACTGTTCC 

TGGCCGCCAT CAACTATTCC ATGCTTAGCC TGGGCAAGTT TTACGCCCGC 
ACCGGCGGTA GTTGATAAGG TACGAATCGG ACCCGTTCAA AATGCGGGCG 

AAGATATACC ATACCCCTTA CGTTCCCATA GACAAGGAGG TAAAGATCGA 
TTCTATATGG TATGGGGAAT GCAAGGGTAT CTGTTCCTCC ATTTCTAGCT 

GG6GTTCTAC ATGCGCATGG CGCTGAAGGT GCTTACCTTG AGCGACGACC 
CCCCAAGATG TACGCGTACC 6CGACTTCCA CGAATGGAAC TCGCTGCTGG 

TGGGCGTTTA TCGCAACGAG CGCATCCACA AGGCCGTGAG CGTGAGCCGG 
ACCCGCAAAT AGCGTTGCTC GCGTAGGTGT TCCGGCACTC GCACTCGGCC 

CGGCGCGAGC TCAGCGACCG CGAGCTGATG CACAGCCTGC AAAGGGCCCT 
GCCGCGCTCG AGTCGCTGGC GCTCGACTAC GTGTCGGACG TTTCCCGGGA 

GGCTGGCACG GGCAGCGGCG ATAGAGAGGC CGAGTCCTAC TTTGACGCGG 
CCGACCGTGC CCGTCGCCGC TATCTCTCCG GCTCAGGATG AAACTGCGCC 

GCGCTGACCT GCGCTGGGCC CCAAGCCGAC GCGCCCTGGA GGCAGCTGGG 
CGCGACTGGA CGCGACCCGG GGTTCGGCTG CGCGGGACCT CCGTCGACCC 

GCCGGACCTG GGCTGGCGGT GGCACCCGCG CGCGCTGGCA ACGTCGGCGG 
CGGCCTGGAC CCGACCGCCA CCGTGGGCGC GCGCGACCGT TGCAGCCGCC 

CGTGGAGGAA TATGACGAGG ACGATGAGTA CGAGCCAGAG GACGGCGAGT 
GCACCTCCTT ATACTGCTCC TGCTACTCAT GCTCGGTCTC CTGCCGCTCA 

ACTAAGCGGT GATGTTTCTG ATCAGATGAT GCAAGACGCA ACGGACCCGG 
TGATTCGCCA CTACAAAGAC TAGTCTACTA CGTTCTGCGT TGCCTGGGCC 

CGGTGCGGGC GGCGCTGCAG AGCCAGCCGT CCGGCCTTAA CTCCACGGAC 
GCCACGCCCG CCGCGACGTC TCGGTCGGCA GGCCGGAATT GAGGTGCCTG 

GACTGGCGCC AGGTCATGGA CCGCATCATG TCGCTGACTG CGCGCAATCC 
CTGACCGCGG TCCAGTACCT GGCGTAGTAC AGCGACTGAC GCGCGTTAGG 

TGACGCGTTC CGGCAGCAGC CGCAGGCCAA CCGGCTCTCC GCAATTCTGG 
ACTGCGCAAG GCCGTCGTCG GCGTCCGGTT GGCCGAGAGG CGTTAAGACC 

AAGCGGTGGT CCCGGCGCGC GCAAACCCCA CGCACGAGAA GGTGCTGGCG 
TTCGCCACCA GGGCCGCGCG CGTTTGGGGT GCGTGCTCTT CCACGACCGC 

ATCGTAAACG CGCTGGCCGA AAACAGGGCC ATCCGGCCCG ACGAGGCCGG 
TAGCATTTGC GCGACCGGCT TTTGTCCCGG TAGGCCGGGC TGCTCCGGCC 

CCTGGTCTAC GACGCGCTGC TTCAGCGCGT GGCTCGTTAC AACAGCGGCA 
GGACCAGATG CTGCGCGACG AAGTCGCGCA CCGAGCAATG TTGTCGCCGT 

ACGTGCAGAC CAACCTGGAC CGGCTGGTGG GGGATGTGCG CGAGGCCGTG 
TGCACGTCTG GTTGGACCTG GCCGACCACC CCCTACACGC GCTCCGGCAC 
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ACTAAACGCC TTCCTGAGTA CACAGCCCGC CAACGTGCCG CGGGGACAGG 
TGATTTGCGG AAGGACTCAT GTGTCGGGCG GTTGCACGGC GCCCCTGTCC 

AGGACTACAC CAACTTTGTG AGCGCACTGC GGCTAATGGT GACTGAGACA 
TCCTGATGTG GTTGAAACAC TCGCGTGACG CCGATTACCA CTGACTCTGT 

CCGCAAAGTG AGGTGTACCA GTCTGGGCCA GACTATTTTT TCCAGACCAG 
GGCGTTTCAC TCCACATGGT CAGACCCGGT CTGATAAAAA AGGTCTGGTC 

TAGACAAGGC CTGCAGACCG TAAACCTGAG CCAGGCTTTC AAAAACTTGC 
ATCTGTTCCG GACGTCTGGC ATTTGGACTC GGTCCGAAAG TTTTTGAACG 

AGGGGCTGTG GGGGGTGCGG GCTCCCACAG GCGACCGCGC GACCGTGTCT 
TCCCCGACAC CCCCCACGCC CGAGGGTGTC CGCTGGCGCG CTGGCACAGA 

AGCTTGCTGA CGCCCAACTC GCGCCTGTTG CTGCTGCTAA TAGCGCCCTT 
TCGAACGACT GCGGGTTGAG CGCGGACAAC GACGACGATT ATCGCGGGAA 

CACGGACAGT GGCAGCGTGT CCCGGGACAC ATACCTAGGT CACTTGCTGA 
GTGCCTGTCA CCGTCGCACA GGGCCCTGTG TATGGATCCA GTGAACGACT 

CACTGTACCG CGAGGCCATA GGTCAGGCGC ATGTGGACGA GCATACTTTC 
GTGACATGGC GCTCCGGTAT CCAGTCCGCG TACACCTGCT CGTATGAAAG 

CAGGAGATTA CAAGTGTCAG CCGCGCGCTG GGGCAGGAGG ACACGGGCAG 
GTCCTCTAAT GTTCACAGTC GGCGCGCGAC CCCGTCCTCC TGTGCCCGTC 

CCTGGAGGCA ACCCTAAACT ACCTGCTGAC CAACCGGCGG CAGAAGATCC 
GGACCTCCGT TGGGATTTGA TGGACGACTG GTTGGCCGCC GTCTTCTAGG 

CCTCGTTGCA CAGTTTAAAC AGCGAGGAGG AGCGCATTTT GCGCTACGTG 
GGAGCAACGT GTCAAATTTG TCGCTCCTCC TCGCGTAAAA CGCGATGCAC 

CAGCAGAGCG TGAGCCTTAA CCTGATGCGC GACGGGGTAA CGCCCAGCGT 
GTCGTCTCGC ACTCGGAATT GGACTACGCG CTGCCCCATT GCGGGTCGCA 

GGCGCTGGAC ATGACCGCGC GCAACATGGA ACCGGGCATG TATGCCTCAA 
CCGCGACCTG TACTGGCGCG CGTTGTACCT TGGCCCGTAC ATACGGAGTT 

ACCGGCCGTT TATCAACCGC CTAATGGACT ACTTGCATCG CGCGGCCGCC 
TGGCCGGCAA ATAGTTGGCG GATTACCTGA TGAACGTAGC GCGCCGGCGG 

GTGAACCCCG AGTATTTCAC CAATGCCATC TTGAACCCGC ACTGGCTACC 
CACTTGGGGC TCATAAAGTG GTTACGGTAG AACTTGGGCG TGACCGATGG 

GCCCCCTGGT TTCTACACCG GGGGATTCGA GGTGCCCGAG GGTAACGATG 
CGGGGGACCA AAGATGTGGC CCCCTAAGCT CCACGGGCTC CCATTGCTAC 

GATTCCTCTG GGACGACATA GACGACAGCG TGTTTTCCCC GCAACCGCAG 
CTAAGGAGAC CCTGCTGTAT CTGCTGTC6C ACAAAAGGGG C6TTGGCGTC 

ACCCTGCTAG AGTTGCAACA GCGCGAGCAG GCAGAGGCGG CGCTGCGAAA 
TGGGACGATC TCAACGTTGT CGCGCTCGTC CGTCTCCGCC GCGACGCTTT 
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CGCGGTCAGA TGCTAGTAGC CCATTTCCAA GCTTGATkGG GTCTCTTACC 
GCGCCAGTCT ACGATCATCG GGTAAAGGTT CGAACTATCC CAGAGAATGG 

AGCACTCGCA CCACCCGCCC GCGCCTGCTG GGCGAGGAGG AGTACCTAAA 
TCGTGAGCGT GGTGGGCGGG CGCGGACGAC CCGCTCCTCC TCATGGATTT 

CAACTCGCTG CTGCAGCCGC AGCGCGAAAA AAACCTGCCT CCGGCATTTC 
GTTGAGCGAC GACGTCGGCG TCGCGCTTTT TTTGGACGGA GGCCGTAAAG 

CCAACAACGG GATAGAGAGC CTAGTGGACA AGATGAGTAG ATGGAAGACG 
GGTTGTTGCC CTATCTCTCG GATCACCTGT TCTACTCATC TACCTTCTGC 

TACGCGCAGG AGCACAGGGA CGTGCCAGGC CCGCGCCCGC CCACCCGTCG 
ATGCGCGTCC TCGTGTCCCT GCACGGTCCG GGCGCGGGCG GGTGGGCAGC 

TCAAAGGCAC GACCGTCAGC GGGGTCTGGT GTGGGAGGAC GATGACTCGG 
AGTTTCCGTG CTGGCAGTCG CCCCAGACCA CACCCTCCTG CTACTGAGCC 

CAGACGACAG CAGCGTCCTG GATTTGGGAG GGAGTGGCAA CCCGTTTGCG 
GTCTGCTGTC GTCGCAGGAC CTAAACCCTC CCTCACCGTT GGGCAAACGC 

CACCTTCGCC CCAGGCTGGG GAGAATGTTT TAAAAAAAAA AAAAGCATGA 
GTGGAAGCGG GGTCCGACCC CTCTTACAAA ATTTTTTTTT TTTTCGTACT 

TGCAAAATAA AAAACTCACC AAGGCCATGG CACCGAGCGT TGGTTTTCTT 
ACGTTTTATT TTTTGAGTGG TTCCGGTACC GTGGCTCGCA ACCAAAAGAA 

GTATTCCCCT TAGTATGCGG CGCGCGGCGA TGTATGAGGA AGGTCCTCCT 
CATAAGGGGA ATCATACGCC GCGCGCCGCT ACATACTCCT TCCAGGAGGA 

CCCTCCTACG AGAGTGTGGT GAGCGCGGCG CCAGTGGCGG CGGCGCTGGG 
GGGAGGATGC TCTCACACCA CTCGCGCCGC GGTCACCGCC GCCGCGACCC 

TTCTCCCTTC GATGCTCCCC TGGACCCGCC GTTTGTGCCT CCGCGGTACC 
AAGAGGGAAG CTACGAGGGG ACCTGGGCGG CAAACACGGA GGCGCCATGG 

TGCGGCCTAC CGGGGGGAGA AACAGCATCC GTTACTCTGA GTTGGCACCC 
ACGCCGGATG GCCCCCCTCT TTGTCGTAGG CAATGAGACT CAACCGTGGG 

CTATTCGACA CCACCCGTGT GTACCTGGTG GACAACAAGT CAACGGATGT 
GATAAGCTGT GGTGGGCACA CATGGACCAC CTGTTGTTCA GTTGCCTACA 

GGCATCCCTG AACTACCAGA ACGACCACAG CAACTTTCTG ACCACGGTCA 
CCGTAGGGAC TTGATGGTCT TGCTGGTGTC GTTGAAAGAC TGGTGCCAGT 

TTCAAAACAA TGACTACAGC CCGGGGGAGG CAAGCACACA GACCATCAAT 
AAGTTTTGTT ACTGATGTCG GGCCCCCTCC GTTCGTGTGT CTGGTAGTTA 

CTTGACGACC GGTCGCACTG GGGCGGCGAC CTGAAAACCA TCCTGCATAC 
GAACTGCTGG CCAGCGTGAC CCCGCCGCTG GACTTTTGGT AGGACGTATG 

CAACATGCCA AATGTGAACG AGTTCATGTT TACCAATAAG TTTAAGGCGC 
GTTGTACGGT TTACACTTGC TCAAGTACAA ATGGTTATTC AAATTCCGCG 
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15151 GGGTGATGGT GTCGCGCTTG CCTACTAAGG ACAATCAGGT GGAGCTGAAA 
CCCACTACCA CAGCGCGAAC GGATGATTCC. TGTTAGTCCA CCTCGACTTT 

15201 TACGAGTGGG TGGAGTTCAC GCTGCCCGAG GGCAACTACT CCGAGACCAT 
ATGCTCACCC ACCTCAAGTG CGACGGGCTC CCGTTGATGA GGCTCTGGTA 

15251 GACCATAGAC CTTATGAACA ACGCGATCGT GGAGCACTAC TTGAAAGTGG 
CTGGTATCTG GAATACTTGT TGCGCTAGCA CCTCGTGATG AACTTTCACC 

15301 GCAGACAGAA CGGGGTTCTG GAAAGCGACA TCGGGGTAAA GTTTGACACC 
CGTCTGTCTT GCCCCAAGAC CTTTCGCTGT AGCCCCATTT CAAACTGTGG 

15351 CGCAACTTCA GACTGGGGTT TGACCCCGTC ACTGGTCTTG TCATGCCTGG 
GCGTTGAAGT CTGACCCCAA ACTGGGGCAG TGACCAGAAC AGTACGGACC 

15401 GGTATATACA AACGAAGCCT TCCATCCAGA CATCATTTTG CTGCCAGGAT 
CCATATATGT TTGCTTCGGA AGGTAGGTCT GTAGTAAAAC GACGGTCCTA 

15451 GCGGGGTGGA CTTCACCCAC AGCCGCCTGA GCAACTTGTT GGGCATCCGC 
CGCCCCACCT GAAGTGGGTG TCGGCGGACT CGTTGAACAA CCCGTAGGCG 

15501 AAGCGGCAAC CCTTCCAGGA GGGCTTTAGG ATCACCTACG ATGATCTGGA 
TTCGCCGTTG GGAAGGTCCT CCCGAAATCC TAGTGGATGC TACTAGACCT 

15551 GGGTGGTAAC ATTCCCGCAC TGTTGGATGT GGACGCCTAC CAGGCGAGCT 
CCCACCATTG TAAGGGCGTG ACAACCTACA CCTGCGGATG GTCCGCTCGA 

15601 TGAAAGATGA CACCGAACAG GGCGGGGGTG GCGCAGGCGG CAGCAACAGC 
ACTTTCTACT GTGGCTTGTC CCGCCCCCAC CGCGTCCGCC GTCGTTGTCG 

15651 AGTGGCAGCG GCGCGGAAGA GAACTCCAAC GCGGCAGCCG CGGCAATGCA 
TCACCGTCGC CGCGCCTT.CT CTTGAGGTTG CGCCGTCGGC GCCGTTACGT 

15701 GCCGGTGGAG GACATGAACG ATCATGCCAT TCGCGGCGAC ACCTTTGCCA 
CGGCCACCTC CTGTACTTGC TAGTACGGTA AGCGCCGCTG TGGAAACGGT 

15751 CACGGGCTGA GGAGAAGCGC GCTGAGGCCG AAGCAGCGGC CGAAGGTGCC 
GTGCCCGACT CCTCTTC6CG CGACTCCGGC TTCGTCGCCG GCTTCGACGG 

15801 GCCCCCGCTG CGCAACCCGA GGTCGAGAAG C.CTCAGAAGA AACCGGTGAT 
CGGGGGCGAC GCGTTGGGCT CCAGCTCTTC GGAGTCTTCT TTGGCCACTA 

15851 CAAACCCCTG ACAGAGGACA GCAAGAAACG CAGTTACAAC CTAATAAGCA 
GTTTGGGGAC TGTCTCCTGT CGTTCTTTGC GTCAATGTTG GATTATTCGT 

15901 ATGACAGCAC CTTCACCCAG TACCGCAGCT GGTACCTTGC ATACAACTAC 
TACTGTCGTG GAAGTGG6TC ATGGCGTCGA CCATGGAACG TATGTTGATG 

15951 GGCGACCCTC AGACCGGAAT CCGCTCATGG ACCCTGCTTT GCACTCCTGA 
CC6CTGGGAG TCTGGCCTTA GGCGAGTACC TGGGACGAAA CGTGAGGACT 

16001 CGTAACCTGC GGCTCGGAGC AGGTCTACTG GTCGTTGCCA GACATGATGC 
GCATTGGACG CCGAGCCTCG TCCAGATGAC CAGCAACGGT CTGTACTACG 

16051 AAGACCCCGT GACCTTCCGC TCCACGCGCC AGATCAGCAA CTTTCCGGTG 
TTCTGGGGCA CTGGAAGGCG AGGTGCGCGG TCTAGTCGTT GAAAGGCCAC 
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GGCCGTCTAC TCCCAACTCA TCCGCCAGTT TACCTCTCTG ACCCACGTGT 
CCGGCAGATG AGGGTTGAGT AGGCGGTCAA ATGGAGAGAC TGGGTGCACA 

TCAATCGCTT TCCCGAGAAC CAGATTTTGG CGCGCCCGCC AGCCCCCACC 
AGTTAGCGAA AGGGCTCTTG GTCTAAAACC GCGCGGGCGG TCGGGGGTGG 

ATCACCACCG TCAGTGAAAA CGTTCCTGCT CTCACAGATC ACGGGACGCT 
TAGTGGTGGC AGTCACTTTT GCAAGGACGA GAGTGTCTAG TGCCCTGCGA 

ACCGCTGCGC AACAGCATCG 6AGGAGTCCA GCGAGTGACC ATTACTGACG 
TGGCGACGCG TTGTCGTAGC CTCCTCAGGT CGCTCACTGG TAATGACTGC 

CCAGACGCCG CACCTGCCCC TACGTTTACA AGGCCCTGGG CATAGTCTCG 
GGTCTGCGGC GTGGACGGGG ATGCAAATGT TCCCGGACCC GTATCAGAGC 

CCGCGCGTCC TATCGAGCCG CACTTTTTGA GCAAGCATGT CCATCCTTAT 
GGCGCGCAGG ATAGCTCGGC GTGAAAAACT CGTTCGTACA GGTAGGAATA 

ATCGCCCAGC AATAACACAG GCTGGGGCCT GCGCTTCCCA AGGAAGATGT 
TAGCGGGTCG TTATTGTGTC CGACCCCGGA CGCGAAGGGT TCGTTCTACA 

TTGGCGGGGC CAAGAAGCGC TCCGACCAAC ACCCAGTGCG CGTGCGCGGG 
AACCGCCCCG GTTCTTCGCG AGGCTGGTTG TGGGTCACGC GCACGCGCCC 

CAGTACCGCG CGCCCTGGGG CGCGCACAAA CGCGGCCGCA CTGGGCGCAC 
GTGATGGCGC GCGGGACCCC GCGCGTGTTT GCGCCGGCGT GACCCGCGTG 

CACCGTCGAT GACGCCATCG ACGCGGTGGT GGAGGAGGCG CGCAACTACA 
GTGGCAGCTA CTGCGGTAGC TGCGCCACCA CCTCCTCCGC GCGTTGATGT 

CGCCCACGCC GCCACCAGTG TCCACAGTGG ACGCGGCCAT TCAGACCGTG 
GCGGGTGCGG CGGTGGTCAC AGGTGTCACC TGCGCCGGTA AGTCTGGCAC 

GTGCGCGGAG CCCGGCGCTA TGCTAAAATG AAGAGACGGC GGAGGCGCGT 
CACGCGCCTC GGGCC6CGAT ACGATTTTAC TTCTCTGCCG CCTCCGCGCA 

AGCACGTCGC CACCGCCGCC GACCCGGCAC TGCCGCCCAA CGCGCGGCGG 
TCGTGCAGCG GTGGCGGCGG CTGGGCCGTG ACGGCGGGTT GCGCGCCGCC 

CGGCCCTGCT TAACCGCGCA CGTCGCACCG GCCGACGGGC GGCCATGCGG 
GCCGGGACGA ATTGGCGCGT GCAGCGTGGC CGGCTGCCCG CCGGTACGCC 

GCCGCTCGAA GGCTGGCCGC GGGTATTGTC ACTGTGCCCC CCAGGTCCAG 
CGGCGAGCTT CCGACCGGC6 CCCATAACAG TGACACGGGG GGTCCAGGTC 

GCGACGAGCG GCCGCCGCAG CAGCCGCGGC CATTAGTGCT ATGACTCAGG 
CGCTGCTCGC CGGCGGCGTC GTCGGCGCCG GTAATCACGA TACTGAGTCC 

GTCGCAGGGG CAACGTGTAT TGGGTGCGCG ACTCGGTTAG CGGCCTGCGC 
CAGCGTCCCC GTTGCACATA ACCCACGCGC TGAGCCAATC GCCGGACGCG 

GTGCCCGTGC GCACGCGCCC CCCGCGCAAC TAQATTGCAA GAAAAAACTA 
CACGGGCACG CGTGGGCGGG GGGCGCGTTG ATCTAACGTT CTTTTTTGAT 
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CTATGTCCAA GCGCAAAATC AAAGAAGAGA TGCTCCAGGT CATCGCGCCG 
GATACAGGTT CGCGTTTTAG TTTCTTCTCT ACGAGGTCCA GTAGCGCGGC 

GAGATCTATG GCCCCCCGAA GAAGGAAGAG CAGGATTACA AGCCCCGAAA 
CTCTAGATAC CGGGGGGCTT CTTCCTTCTC GTCCTAATGT TCGGGGCTTT 



GCTAAAGCGG GTCAAAAAGA AAAAGAAAGA TGATGATGAT GAACTTGACG 
CGATTTCGCC CAGTTTTTCT TTTTCTTTCT ACTACTACTA CTTGAACTGC 

ACGAGGTGGA ACTGCTGCAC GCTACCGCGC CCAGGCGACG GGTACAGTGG 
TGCTCCACCT TGACGACGTG CGATGGCGCG GGTCCGCTGC CCATGTCACC 

AAAGGTCGAC GCGTAAAACG TGTTTTGCGA CCCGGCACCA CCGTAGTCTT 
TTTCCAGCTG CGCATTTTGC ACAAAACGCT GGGCCGTGGT GGCATCAGAA 

TACGCCCGGT GAGCGCTCCA CCCGCACCTA CAAGCGCGTG TATGATGAGG 
ATGCGGGCCA CTCGCGAGGT GGGCGTGGAT GTTCGCGCAC ATACTACTCC 

TGTACGGCGA CGAGGACCTG CTTGAGCAGG CCAACGAGCG CCTCGGGGAG 
ACATGCCGCT GCTCCTGGAC GAACTCGTCC GGTTGCTCGC GGAGCCCCTC 

TTTGCCTACG GAAAGCGGCA TAAGGACATG CTGGCGTTGC CGCTGGACGA 
AAACGGATGC CTTTCGCCGT ATTCCTGTAC GACCGCAACG GCGACCTGCT 

GGGCAACCCA ACACCTAGCC TAAAGCCCGT AACACTGCAG CAGGTGCTGC 
CCCGTTGGGf TGTGGATCGG ATTTCGGGCA TTGTGACGTC GTCCACGACG 

CC6CGCTTGC ACCGTCCGAA GAAAAGCGCG GCCTAAAGCG CGAGTCTGGT 
GGCGCGAACG TGGCAGGCTT CTTTTCGCGC CGGATTTCGC GCTCAGACCA 

GACTTGGCAC CCACCGTGCA GCTGATGGTA CCCAAGCGCC AGGGACTGGA 
CTGAACCGTG GGTGGCACGT CGACTACCAT GGGTTCGCGG TCGCTGACCT 

AGATGTCTTG GAAAAAATGA. CCGTGGAACC TGGGCTGGAG CCCGAGGTCC 
TCTACAGAAC CTTTTTTACT GGCACCTTGG ACCCGACCTC GGGCTCCAGG 

GCGTGCGGCC AATCAAGCAG GTGGCGCCGG GACTGGGCGT GCAGACCGTG 
CGCACGCCGG TTAGTTCGTC CACCGCGGCC CTGACCCGCA CGTCTGGCAC 

GACGTTCAGA TACCCACTAC CAGTAGCACC AGTATTGCCA CCGCCACAGA 
CTGCAAGTCT ATGGGTGATG GTCATCGTGG TCATAACGGT GGCGGTGTCT 

GGGCATGGAG ACACAAACGT CCCCGGTTGC CTCAGCGGTG GCGGATGCCG 
CCCGTACCTC TGTGTTTGCA GGGGCCAAC6 GAGTCGCCAC CGCCTACGGC 

CGGTGCAGGC GGTCGCTGCG GCCGCGTCCA AGACCTCTAC GGAGGTGCAA 
GCCACGTCCG CCAGCGACGC CGGCGCAGGT TCTGGAGATG CCTCCACGTT 

ACGGACCCGT GGATGTTTCG CGTTTCAGCC CCCCGGCGCC CGCGCCGTTC 
TGCCTGGGCA CCTACAAAGC GCAAAGTCGG GGGGCCGCGG GCGCGGCAAG 

GAGGAAGTAC GGCGCCGCCA GCGCGCTACT GCCCGAATAT GCCCTACATC 
CTCCTTCATG CCGCGGCGGT CGCGCGATGA CGGGCTTATA CGGGATGTAG 
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AGACGAGCAA CTACCCGACG CCGAACCACC ACTGGAACCC GCCGCCGCCG 
TCTGCTCGTT GATGGGCTGC GGCTTGGTGG TGACCTTGGG CGGCGGCGGC 

TCGCCGTCGC CAGCCCGTGC TGGCCCCGAT TTCCGTGCGC AGGGTGGCTC 
AGCGGCAGCG GTCGGGCACG ACCGGGGCTA AAGGCACGCG TCCCACCGAG 



GCGAAGGAGG CAGGACCCTG GTGCTGCCAA CAGCGCGCTA CCACCCCAGC 
CGCTTCCTCC GTCCTGGGAC CACGACGGTT GTCGCGCGAT GGTGGGGTC6 

ATCGTTTAAA AGCCGGTCTT TGTGGTTCTT GCAGATATGG CCCTCACCTG 
TAGCAAATTT TCGGCCAGAA ACACCAAGAA CGTCTATACC GGGAGTGGAC 

CCGCCTCCGT TTCCCGGTGC CGGGATTCCG AGGAAGAATG CACCGTAGGA 
GGCGGAGGCA AAGGGCCACG GCCCTAAGGC TCCTTCTTAC GTGGCATCCT 

GGGGCATGGC CGGCCACGGC CTGACGGGCG GCATGCGTCG TGCGCACCAC 
CCCCGTACCG GCCGGTGCCG GACTGCCCGC CGTACGCAGC ACGCGTGGTG 

CGGCGGCGGC GCGCGTCGCA CCGTCGCATG CGCGGCGGTA TCCTGCCCCT 
GCCGCCGCCG CGCGCAGCGT GGCAGCGTAC GCGCCGCCAT AGGACGGGGA 

CCTTATTCCA CTGATCGCCG CGGCGATTGG CGCCGTGCCC GGAATTGCAT 
GGAATAAGGT GACTAGCGGC GCCGCTAACC GCGGCACGGG CCTTAACGTA 

CCGTGGCCTT GCAGGCGCAG AGACACTGAT TAAAAACAAG TTGCATGTGG 
GGCACCGGAA CGTCCGCGTC TCTGTGACTA ATTTTTGTTC AACGTACACC 

AAAAATCAAA ATAAAAAGTC TGGACTCTCA CGCTCGCTTG GTCCTGTAAC 
TTTTTAGTTT TATTTTTCAG ACCTGAGAGT GCGAGCGAAC CAGGACATTG 

TATTrTGTAG AATGGAAGAC ATCAACTTTG CGTCTCTGGC CCCGCGACAC 
ATAAAACATC TTACCTTCTG TAGTTGAAAC GCAGAGACCG GGGCGCTGTG 

GGCTCGCGCC CGTTCATGGG AAACTGGCAA GATATCGGCA CCAGCAATAT 
CCGAGCGCGG GCAAGTACCC TTTGACCGTT CTATAGCCGT GGTCGTTATA 

GAGCGGTGGC GCCTTCAGCT GGGGCTCGCT GTGGAGCGGC ATTAAAAATT 
CTCGCCACCG CGGAAGTCGA CCCCGAGCGA CACCTCGCCG TAATTTTTAA 

TCGGTTCCAC CGTTAAGAAC TATGGCAGCA AGGCCTGGAA CAGCAGCACA 
AGCCAAGGTG GCAATTCTTG ATACCGTCGT TCCGGACCTT GTCGTC6T6T 

GGCCAGATGC TGAGGGATAA GTTGAAAGAG CAAAATTTCC AACAAAAGGT 
CCGGTCTACG ACTCCCTATT CAACTTTCTC GTTTTAAAGG TTGTTTTCCA 

GGTAGATGGC CTGGCCTCTG GCATTAGCGG GGTGGTGGAC CTGGCCAACC 
CCATCTACCG GACCGGAGAC CGTAATCGCC CCACCACCTG GACCGGTTGG 

AGGCAGTGCA AAATAAGATT AACAGTAAGC TTGATCCCCG CCCTCCCGTA 
TCCGTCACGT TTTATTCTAA TTGTCATTCG AACTAGGGGC GGGAGGGCAT 

GAGGAGCCTC CACCGGCCGT GGAGACAGTG TCTCCAGAGG GGCGTGGCGA 
CTCCTCGGAG 6TGGCCGGCA CCTCT6TCAC AGAGGTCTCC CCGCACCGCT 
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AGCCTCCCTC GTACGAGGAG GCACTAAAGC AAGGCCTGCC CACCACCCGT 
TCGGAGGGAG CATGCTCCTC CGTGATTTCG TTCCGGACGG GTGGTGGGCA 

CCCATCGCGC CCATGGCTAC CGGAGTGCTG GGCCAGCACA CACCCGTAAC 
GGGTAGCGCG GGTACCGATG GCCTCACGAC CCGGTCGTGT GTGGGCATTG 



GCTGGACCTG CCTCCCCCCG CCGACACCCA GCAGAAACCT GTGCTGCCAG 
CGACCTGGAC GGAGGGGGGC G6CTGTGGGT CGTCTTTGGA CACGACGGTC 

GCCCGACCGC CGTTGTTGTA ACCCGTCCTA GCCGCGCGTC CCTGCGCCGC 
CGGGCTGGCG GCAACAACAT TGGGCAGGAT CGGCGCGCAG GGACGCGGCG 

GCCGCCAGCG GTCCGCGATC GTTGCGGCCC GTAGCCAGTG GCAACTGGCA 
CGGCGGTCGC CAGGCGCTAG CAACGCCGGG CATCGGTCAC CGTTGACCGT 

AAGCACACTG AACAGCATCG TGGGTCTGGG GGTGCAATCC CTGAAGCGCC 
TTCGTGTGAC TTGTCGTAGC ACCCAGACCC CCACGTTAGG 6ACTTCGCGG 

GACGATGCTT CTGATAGCTA ACGTGTCGTA TGTGTGTCAT GTATGCGTCC 
CTGCTACGAA GACTATCGAT TGCACAGCAT ACACACA6TA CATACGCAGG 

ATGTCGCCGC CAGAGGAGCT GCTGAGCCGC CGCGCGCCCG CTTTCCAAGA 
TACAGCGGCG GTCTCCTCGA CGACTCGGCG GCGCGCGGGC GAAAGGTTCT 

TGGCTACCCC TTCGATGATG CCGCAGTGGT CTTACATGCA CATCTCGGGC 
ACCGATGGGG AAGCTACTAC GGCGTCACCA GAATGTACGT GTAGAGCCCG 

CAGGACGCCT CGGAGTACCT GAGCCCCGGG CTGGTGCAGT TTGCCCGCGC 
GTCCTGCGGA GCCTCATGGA CTCGGGGCCC GACCACGTCA AACGGGCGCG 

CACCGAGACG TACTTCAGCC TGAATAACAA GTTTAGAAAC CCCACGGTGG 
GTGGCTCTGC ATGAAGTCGG ACTTATTGTT CAAATCTTTG GGGTGCCACC 

CGCCTACGCA CGACGTGACC ACAGACCGGT CCCAGCGTTT GACGCTGCGG 
GCGGATGCGT GCTGCACTGG TGTCTGGCCA GGGTCGCAAA CTGCGACGCC 

TTCATCCCTG TGGACCGTGA GGATACTGCG TACTCGTACA AGGCGCGGTT 
AAGTAGGGAC ACCTGGCACT CCTATGACGC ATGAGCATGT TCCGCGCCAA 

CACCCTAGCT GTGGGTGATA ACCGTGTGCT GGACATGGCT TCCACGTACT 
GTGGGATCGA CACCCACTAT TGGCACACGA CCTGTACCGA AGGTGCATGA 

TTGACATCCG CGGCGTGCTG GACAGGGGCC CTACTTTTAA GCCCTACTCT 
AACTGTAGGC GCCGCACGAC CTGTCCCCGG GATGAAAATT CGGGATGAGA 

GGCACTGCCT ACAACGCCCT GGCTCCCAAG GGTGCCCCAA ATCCTTGCGA 
CCGTGACGGA TGTTGCGGGA CCGAGGGTTC CCACGGGGTT TAGGAACGCT 

ATGGGATGAA GCTGCTACTG CTC7TGAAAT AAACCTAGAA GAAGAGGACG 
TACCCTACTT C6ACGATGAC GAGAACTTTA TTTGGATCTT CTTCTCCTGC 

ATGACAACGA AGACGAAGTA GACGAGCAAG CTGAGCAGCA AAAAACTCAC 
TACTCSTTGCT TCTGCTTCAT CTGCTCGTTC GACTCGTCGT TTTTTGAGTG 
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TCAAATAGGT 6TCGAAGGTC AAACACCTAA ATATGCCGAT AAAACATTTC 
AGTTTATCCA CAGCTTCCAG TTTGTGGATT TATACGGCTA TTTTGTAAAG 

AACCTGAACC TCAAATAGGA GAATCTCAGT GGTACGAAAC AGAAATTAAT 
TTGGACTTGG AGTTTATCCT CTTAGAGTCA CCATGCTTTG TCTTTAATTA 

CATGCAGCTG GGAGAGTCCT AAAAAAGACT ACCCCAATGA AACCATGTTA 
GTACGTCGAC CCTCTCAGGA TTTTTTCTGA TGGGGTTACT TTGGTACAAT 

CGGTTCATAT GCAAAACCCA CAAATGAAAA TGGAGGGCAA GGCATTCTTG 
GCCAAGTATA CGTTTTGGGT GTTTACTTTT ACCTCCCGTT CCGTA.i.GAAC 

TAAAGCAACA AAATGGAAAG CTAGAAAGTC AAGTGGAAAT GCAATTTTTC 
ATTTCGTTGT TTTACCTTTC GATCTTTCAG TTCACCTTTA CGTTAAAAAG 

TCAACTACTG AGGCAGCCGC AGGCAATGGT GATAACTTGA CTCCTAAAGT 
AGTTGATGAC. TCCGTCGGCG TCCGTTACCA CTATTGAACT GAGGATTTCA 

GGTATTGTAC AGTGAAGATG TAGATATAGA AACCCCAGAC ACTCATATTT 
CCATAACATG TCACTTCTAC ATCTATATCT TTGGGGTCTG TGAGTATAAA 

CTTACATGCC CACTATTAAG GAAGGTAACT CACGAGAACT AATGGGCCAA 
GAATGTACGG GTGATAATTC CTTCCATTGA GTCCTCTxisA TTACCCGGTT 

CAATCTATGC CCAACAGGCC TAATTACATT GCTTTTAGGG ACAATTTTAT 
GTTAGATACG GGTTGTCCGG ATTAATGTAA CGAAAATCCC TGTTAAAATA 

TGGTCTAATG TATTACAACA GCACGGGTAA TATGGGTGTT CTGGCGGGCC 
ACCAGATTAC ATAATGTTGT CGTGCCCATT ATACCCACAA GACCGCCCGG 

AAGCATCGCA GTTGAATGCT GTTGTAGATT TGCAAGACAG AAACACAGAG 
TTCGTAGCGT CAACTTACGA CAACATCTAA ACGTTCTGTC TTTGTGTCTC 

CTTTCATACC AGCTTTTGCT TGATTCCATT GGTGATAGAA CCAGGTACTT 
GAAAGTATGG TCGAAAACGA ACTAAGGTAA CCACTATCTT GGTCCATGAA 

TTCTATGTGG AATCAGGCTG TTGACAGCTA TGATCCAGAT GTTAGAATTA 
AAGATACACC TTAGTCCGAC AACTGTCGAT ACTAGGTCTA CAATCTTAAT 

TTGAAAATCA TGGAACTGAA GATGAACTTC CAAATTACTG CTTTCCACTG 
AACTTTTAGT ACCTTGACTT CTACTTGAAG GTTTAATGAC GAAAGGTGAC 

GGAGGTGTGA TTAATACAGA GACTCTTACC AAGGTAAAAC CTAAAACAGG 
CCTCCACACT AATTATGTCT CTGAGAATGG TTCCATTTTG GATTTTGTCC 

TCAGGAAAAT GGATGGGAAA AAGATGCTAC AGAATTT7CA GATAAAAATG 
AGTCCTTTTA CCTACCCTTT TTCTACGATG TCTTAAAAGT CTATTTTTAC 

AAATAAGAGT TGGAAATAAT TTTGCCATGG AAATCAATCT AAATGCCAAC 
TTTATTCTCA ACCTTTATTA AAACGGTACC TTTAGTTAGA TTTACGGTTG 

CTGTGGAGAA ATTTCCTGTA CTCCAACATA GCGCTGTATT TGCCCGACAA 
GACACCTCTT TAAAGGACAT GAGGTTGTAT CGCGACATAA ACGGGCTGTT 
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ACGACTACAT GAACAAGCGA GTGGTG6CTC CCGGGCTAGT GGACTGCTAC 
TGCTGATGTA CTTGTTCGCT CACCACCGAG G6CCCGATCA CCTGACGATG 

ATTAACCTTG GAGCACGCTQ GTCCCTTGAC TATATGGACA ACGTCAACCC 
TAATTGGAAC CTCGTGCGAC CAGGGAACTG ATATACCTGT TGCAGTTGGG 

ATTTAACCAC CACCGCAATG CTGGCCTGCG CTACCGCTCA ATGTTGCTGG 
TAAATTGGTG GTGGCGTTAC GACCGGACGC GATGGCGAGT TACAACGACC 

GCAATGGTCG CTATGTGCCC TTCCACATCC AGGTGCCTCA GAAGTTCTTT 
CGTTACCAGC GATACACGGG AAGGTGTAGG TCCACGGAGT CTTCAAGAAA 

GCCATTAAAA ACCTCCTTCT CCTGCCGGGC TCATACACCT ACGAGTGGAA 
CGGTAATTTT TGGAGGAAGA GGACGGCCCG AGTATGTGGA TGCTCACCTT 

CTTCAGGAAG GATGTTAACA TGGTTCTGCA GAGCTCCCTA GGAAATGACC 
GAAGTCCTTC CTACAATTGT ACCAAGACGT CTCGAGGGAT CCTTTACTGG 

TAAGGGTTGA CGGAGCCAGC ATTAAGTTTG ATAGCATTTG CCTTTACGCC 
ATTCCCAACT GCCTCGGTCG TAATTCAAAC TATCGTAAAC GGAAATGCGG 

ACCTTCTTCC CCATGGCCCA CAACACCGCC TCCACGCTTG AGGCCATGCT 
TGGAAGAAG6 GGTACCGGGT GTTGTGGCGG AGGTGCGAAC TCCGGTACGA 

TAGAAACGAC ACCAACGACC AGTCCTTTAA CGACTATCTC TCCGCCGCCA 
ATCTTTGCTG TGGTTGCTGG TCAGGAAATT GCTGATAGAG AGGCGGCGGT 

ACATGCTCTA CCCTATACCC GCCAACGCTA CCAACGTGCC CATATCCATC 
TGTACGAGAT GGGATATGGG CGGTTGCGAT GGTTGCACGG GTATAGGTAG 

CCCTCCCGCA ACTGGGCGGC TTTCCGCGGC TGGGCCTTCA CGCGCCTTAA 
GGGAGGGCGT TGACCCGCCG AAAGGCGCCG ACCCGGAAGT GCGCGGAATT 

GACTAAGGAA ACCCCATCAC TGGGCTCGGG CTACGACCCT TATTACACCT 
CTGATTCCTT TGGGGTAGTG ACCCGAGCCC GATGCTGGGA ATAATGTGGA 

ACTCTGGCTC TATACCCTAC CTAGATGGAA CCTTTTACCT CAACCACACC 
TGAGACCGAG ATATGGGATG GATCTACCTT GGAAAATGGA GTTGGTGTGG 

TTTAAGAAGG TGGCCATTAC CTTTGACTCT TCTGTCAGCT GGCCTGGCAA 
AAATTCTTCC ACCGGTAATG GAAACTGAGA AGACAGTCGA CCGGACCGTT 

TGACCGCCTG CTTACCCCCA ACGAGTTTGA AATTAAGCGC TCAGTTGACG 
ACTGGCGGAC GAATGGG6GT TGCTGAAACT TTAATTCGCG AGTCAACTGC 

GGGAGGGTTA CAACGTTGCC CAGTGTAACA TGACCAAAGA CTGGTTCCTG 
CCCTCCCAAT GTTGCAACGG GTCACATTGT ACTGGTTTCT GACCAAGGAC 

GTACAAATGC TAGCTAACTA TAACATTGGC TACCAGGGCT TCTATATCCC 
CATGTTTACG ATCGATTGAT ATTGTAACCG ATGGTCCCGA AGATATAGGG 

AGAGAGCTAC AAGGACCGCA TGTACTCCTT CTTTAGAAAC TTCCAGCCCA 
TCTCTCGATG TTCCTGGCGT ACATGAGGAA GAAATCTTTG AAGGTCGGGT 
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GGCATCCTAC ACCAACACAA CAACTCTGGA TTTGTTGGCT ACCTTGCCCC 
CCGTAGGATG TGGTTGTGTT GTTGAGACCT AAACAACCGA TGGAACGGGG 

CACCATGCGC GAAGGACAGG CCTACCCTGC TAACTTCCCC TATCCGCTTA 
GTGGTACGCG CTTCCTGTCC GGATGGGACG ATTGAAGGGG ATAGGCGAAT 

TAGGCAAGAC CGCAGTTGAC AGCATTACCC AGAAAAAGTT TCTTTGCGAT 
ATCCGTTCT6 GCGTCAACTG TCGTAATGGG TCTTTTTCAA AGAAACGCTA 

CGCACCCTTT GGCGCATCCC ATTCTCCAGT AACTTTATGT CCATGGGCGC 
GCGTGGGAAA CCGCGTAGGG TAAGAGGTCA TTGAAATACA GGTACCCGCG 

ACTCACAGAC CTGGGCCAAA ACCTTCTCTA CGCCAACTCC GCCCACGCGC 
TGAGTGTCTG GACCCGGTTT TGGAAGAGAT GCGGTTGAGG CGGGTGCGCG 

TAGACATGAC TTTTGAGGTG GATCCCATGG ACGAGCCCAC CCTTCTTTAT 
ATCTGTACTG AAAACTCCAC CTAGGGTACC TGCTCGGGTG GGAAGAAATA 

GTTTTGTTTG AAGTCTTTGA CGTGGTCCGT GTGCACCAGC CGCACCGCGG 
CAAAACAAAC TTCAGAAACT GCACCAGGCA CACGTGGTCG GCGTGGCGCC 

CGTCATCGAA ACCGTGTACC TGCGCACGCC CTTCTCGGCC GGCAACGCCA 
GCAGTAGCTT TGGCACATGG ACGCGTGCGG GAAGAGCCGG CCGTTGCGGT 

CAACATAAAG AAGCAAGCAA CATCAACAAC AGCTGCCGCC ATGGGCTCCA 
GTTGTATTTC TTCGTTCGTT GTAGTTGTTG TCGACGGCGG TACCCGAGGT 

GTGAGCAGGA ACTGAAAGCC ATTGTCAAAG ATCTTGGTTG TG6GCCATAT 
CACTCGTCCT TGACTTTCGG TAACAGTTTC TAGAACCAAC ACCCGGTATA 

TTTTTGGGCA CCTATGACAA GCGCTTTCCA GGCTTTGTTT CTCCACACAA 
AAAAACCCGT GGATACTGTT CGCGAAAGGT CCGAAACAAA GAGGTGTGTT 

GCTCGCCTGC GCCATAGTCA ATACGGCCGG TCGCGAGACT GGGGGCGTAC 
CGAGCGGACG CGGTATCAGT TATGCCGGCC AGCGCTCTGA CCCCCGCATG 

ACTGGATGGC CTTTGCCTGG AACCCGCACT CAAAAACATG CTACCTCTTT 
TGACCTACCG GAAACGGACC TTGGGCGTGA GTTTTTGTAC GATGGAGAAA 

GAGCCCTTTG GCTTTTCTGA CCAGCGACTC AAGCAGGTTT ACCAGTTTGA 
CTCGGGAAAC CGAAAAGACT GGTCGCTGAG TTCGTCCAAA TGGTCAAACT 

GTACGAGTCA CTCCTGCGCC GTAGCGCCAT TGCTTCTTCC CCCGACCGCT 
CATGCTCA6T GAGGACGCGG CATCGCGGTA AC6AAGAAGG GGGCTGGCGA 

GTATAACGCT GGAAAAGTCC ACCCAAAGCG TACAGGGGCC CAACTCGGCC 
CATATTGCGA CCTTTTCAGG TGGGTTTCGC ATGTCCCCGG GTTGAGCCGG 

GCCTGTGGAC TATTCTGCTG CATGTTTCTC CACGCCTTTG CCAACTGGCC 
CGGACACCTG ATAAGACGAC GTACAAAGAG GTGCGGAAAC GGTTGACCGG 

CCAAACTCCC ATGGATCACA ACCCCACCAT GAACCTTATT ACCGGGGTAC 
GGTTTGAGGG TACCTAGTGT TGGGGTGGTA CTTGGAATAA TGGCCCCATG 
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22801 CAGGAACAGC TCTACAGCTT CCTGGAliL'Ue CAC-iVAyUCv, i /ii. i 

GTCCTTGTCG AGATGTCGAA GGACCTCGCG GTGAGCGGGA TGAAGGCGTC 

22851 CCACAGTGCG CAGATTAGGA GCGCCACTTC TTTTTGTCAC TTGAAAAACA 
GGTGTCACGC GTCTAATCCT CGCGGTGAAG AAAAACAGTG AACTTTTTGT 

22901 TGTAAAAATA ATGTACTAGA GACACTTTCA ATAAAGGCAA ATGCTTTTAT 
ACATTTTTAT TACATGATCT CTGTGAAAGT TATTTCCGTT TACGAAAATA 



22951 TTGTACACTC TCGGGTGATT ATTTACCCCC ACCCTTGCCG TCTGCGCCGT 
AACATGTGAG AGCCCACTAA TAAATGGGGG TGGGAAC6GC AGACGCGGCA 

23001 TTAAAAATCA AAGGGGTTCT 6CCGCGCATC GCTATGCGCC ACTGGCAGGG 
AATTTTTAGT TTCCCCAAGA CGGCGCGTAG CGATACGCGG TGACCGTCCC 

23051 ACACGTTGCG ATACTGGTGT TTAGTGCTCC ACTTAAACTC AGGCACAACC 
TGTGCAACGC TATGACCACA AATCACGAGG TGAATTTGAG TCCGTGTTGG 

23101 ATCCGCGGCA GCTCGGTGAA GTTTTCACTC CACAGGCTGC GCACCATCAC 
TAGGCGCCGT CGAGCCACTT CAAAAGTGAG GTGTCCGACG CGTGGTAGTG 

23151 CAACGCGTTT AGCAGGTCGG GCGCCGATAT CTTGAAGTCG CAGTTGGGGC 
GTTGCGCAAA TCGTCCAGCC CGCGGCTATA GAACTTCAGC GTCAACCCCG 

23201 CTCCGCCCTG CGCGCGCGAG TTGCGATACA CAGGGTTGCA GCACTGGAAC 
GAGGCGGGAC GCGCGCGCTC AACGCTATGT GTCCCAACGT CGTGACCTTG 

23251 ACTATCAGCG CCGGGTGGTG CACGCTGGCC AGCACGCTCT TGTCGGAGAT 
TGATAGTCGC GGCCCACCAC GTGCGACCGG TCGTGCGAGA ACAGCCTCTA 

23301 CAGATCCGCG TCCAGGTCCT CCGCGTTGCT CAGGGCGAAC GGAGTCAACT 
GTCTAGGCGC AGGTCCAGGA GGCGCAACGA GTCCCGCTTG CCTCAGTTGA 

23351 TTGGTAGCTG CCTTCCCAAA AAGGGCGCGT GCCCAGGCTT TGAGTTGCAC 
AACCATCGAC GGAAGGGTTT TTCCCGCGCA CGGGTCCGAA ACTCAACGTG 

23401 TCGCACCGTA GTGGCATCAA AAGGTGACCG TGCCCGGTCT GGGCGTTAGG 
AGCGTGGCAT CACCGTAGTT TTCCACTGGC ACGGGCCAGA CCCGCAATCC 

23451 ATACAGCGCC TGCATAAAAG CCTTGATCTG CTTAAAAGCC ACCTGAGCCT 
TATGTCGCGG ACGTATTTTC GGAACTAGAC GAATTTTCGG TGGACTCGGA 

23501 TTGCGCCTTC AGAGAAGAAC ATGCCGCAAG ACTTGCCGGA AAACTGATTG 
AACGCGGAAG TCTCTTCTTG TACGGCGTTC TGAACGGCCT TTTGACTAAC 

23551 GCCGGACAGG CCGCGTCGTG CACGCAGCAC CTTGCGTCGG TGTTGGAGAT 
CGGCCTGTCC GGCGCAGCAC GTGCGTCGTG GAACGCAGCC ACAACCTCTA 

23601 CTGCACCACA TTTCGGCCCC ACCGGTTCTT CACGATCTTG GCCTTGCTAG 
GACGTGGTGT AAAGCCGGGG TGGCCAAGAA GTGCTAGAAC CGGAAC6ATC 

23651 ACTGCTCCTT CAGCGCGCGC TGCCCGTTTT CGCTCGTCAC ATCCATTTCA 
TGACGAGGAA GTCGCGCGCG ACGGGCAAAA GCGAGCAGTG TAGGTAAAGT 
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ATCACGTGCT CCTTATTTAT CATAATGCTT CCGTGTAGAC ACTTAAGCTC 
TAGTGCACGA GGAATAAATA GTATTACGAA GGCACATCTG TGAATTCGAG 

6CCTTCGATC TCAGCGCAGC GGTGCAGCCA CAACGCGCAG CCCGTGGGCT 
CGGAAGCTA6 AGTCGCGTCG CCACGTCGGT GTTGCGCGTC GGGCACCCGA 

CGTGATGCTT GTAGGTCACC TCTGCAAACG ACTGCAGGTA CGCCTGCAGG 
GCACTACGAA CATCCAGTGG AGACGTTTGC TGACGTCCAT GCGGACGTCC 

AATCGCCCCA TCATCGTCAC AAA6GTCTTG TTGCTGGTGA AGGTCAGCTG 
TTAGCGGGGT AGTAGCAGTG TTTCCAGAAC AACGACCACT TCCAGTCGAC 



CAACCCGCGG TGCTCCTCGT TCAGCCAGGT CTTGCATACG GCCGCCAGAG 
GTTGGGCGCC ACGAGGAGCA AGTCGGTCCA GAACGTATGC CGGCGGTCTC 

CTTCCACTTG GTCAGGCAGT AGTTTGAAGT TCGCCTTTAG ATCGTTATCC 
GAAGGTGAAC CAGTCCGTCA TCAAACTTCA AGCGGAAATC TAGCAATAGG 

ACGTGGTACT TGTCCATCAG CGCGCGCGCA GCCTCCATGC CCTTCTCCCA 
TGCACCATGA ACAGGTAGTC GCGCGCGCGT CGGAGGTACG GGAAGAGGGT 

CGCAGACACG ATCGGCACAC TCAGCGGGTT CATCACCGTA ATTTCACTTT 
GC6TCTGTGC TAGCCGTGTG AGTCGCCCAA 6TAGTGGCAT TAAAGTGAAA 

CCGCTTCGCT GGGCTCTTCC TCTTCCTCTT GCGTCCGCAT ACCACGCGCC 
GGCGAAGCGA CCCGAGAAGG AGAAGGAGAA CGCAGGCGTA TGGTGCGCGG 

ACTGGGTCGT CTTCATTCAG CCGCCGCACT GTGCGCTTAC CTCCTTTGCC 
TGACCCAGCA GAAGTAAGTC GGCGGCGTGA CACGCGAATG GAGGAAACGG 

ATGCTTGATT AGCACCGGTG GGTTGCTGAA ACCCACCATT TGTAGCGCCA 
TACGAACTAA TCGTGGCCAC CCAACGACTT TGGGTGGTAA ACATCGCGGT 

CATCTTCTCT TTCTTCCTCG CTGTCCACGA TTACCTCTGG TGATGGCGGG 
GTAGAAGAGA AAGAAGGAGC GACAGGTGCT AATGGAGACC ACTACCGCCC 

CGCTCGGGCT TGGGAGAAGG GCGCTTCTTT TTCTTCTTGG GCGCAATGGC 
GCGAGCCCGA ACCCTCTTCC CGCGAAGAAA AAGAAGAACC CGCGTTACCG 

CAAATCCGCC GCCGAGGTCG ATGGCCGCGG GCTGGGTGTG CGCGGCACCA 
GTTTAGGCGG CGGCTCCAGC TACCGGCGCC CGACCCACAC GCGCCGTGGT 

GCGCGTCTTG TGATGAGTCT TCCTCGTCCT CGGACTCGAT ACGCCGCCTC 
C6CGCAGAAC ACTACTCAGA AGGAGCAGGA . GCCTGAGCTA T6CGGCGGAG 

ATCCGCTTTT TTGGGGGCGC CCGGGGAGGC GGCGGCGACG GGGACGGGGA 
TAGGCGAAAA AACCCCCGCG GGCCCCTCCG CCGCCGCTGC CCCTGCCCCT 

CGACACGTCC TCCATGGTTG GGGGACGTC6 CGCCGCACCG CGTCCGCGCT 
GCT6TGCAGG AGGTACCAAC CCCCTGCAGC GCGGC6TGGC GCAGGCGCGA 

CGGGGGTGGT TTCGCGCTGC TCCTCTTCCC GACTGGCCAT TTCCTTCTCC 
GCCCCCACCA AAGCGCGACG AGGAGAAGGG CTGACCGGTA AAGGAAGAGG 

TATAGGCAGA AAAAGATCAT GGAGTCAGTC GAGAAGAAGG ACAGCCTAAC 
ATATCCGTCT TTTTCTAGTA CCTCAGTCAG CTCTTCTTCC TGTCGGATTG 
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CTACCACCTT CCCCGTCGAG GCACCCCCGC TTGAC3GAGGA GGAAGTGATT 
GATGGTGGAA GGGGCAGCTC CGTGGGGGC6 AACTCCTCCT CCTTCACTAA 

ATCGAGCAGG ACCCAGGTTT TGTAAGCGAA GACGACGAGG ACCGCTCAGT 
TAGCTCGTCC TGGGTCCAAA ACATTCGCTT CTGCTGCTCC TGGCGAGTCA 

ACCAACAGAG GATAAAAAGC AAGACCAGGA CAACGCAGAG GCAAACGAGG 
TGGTTGTCTC CTATTTTTCG TTCTGGTCCT GTTGCGTCTC CGTTTGCTCC 



AACAAGTCGG GCGGGGGGAC GAAAGGCATG GCGACTACCT AGATGTGGGA 
TTGTTCAGCC CGCCCCCCTG CTTTCCGTAC CGCTGATGGA TCTACACCCT 

GACGACGTGC TGTTGAAGCA TCTGCAGCGC CAGTGCGCCA TTATCTGCGA 
CTGCTGCACG ACAACTTCGT AGACGTCGCG GTCACGCGGT AATAGACGCT 

CGCGTTGCAA GAGCGCAGCG ATGTGCCCCT CGCCATAGCG GATGTCAGCC 
GCGCAACGTT CTCGCGTCGC TACACGGGGA GCGGTATCGC CTACAGTCGG 

TTGCCTACGA ACGCCACCTA TTCTCACCGC GCGTACCCCC CAAACGCCAA 
AACGGATGCT TGCGGTGGAT AAGAGTGGCG CGCATGGGGG GTTTGCGGTT 

GAAAACGGCA CATGCGAGCC CAACCCGCGC CTCAACTTCT ACCCCGTATT 
CTTTTGCCGT GTACGCTCGG GTTGGGCGCG GAGTTGAAGA TGGGGCATAA 

TGCCGTGCCA GAGGTGCTTG CCACCTATCA CATCTTTTTC CAAAACTGCA 
ACGGCACGGT CTCCACGAAC GGTGGATAGT GTAGAAAAAG GTTTTGACGT 

AGATACCCCT ATCCTGCCGT GCCAACCGCA GCCGAGCGGA CAAGCAGCTG 
TCTATGGGGA TAGGACGGCA CGGTTGGCGT CGGCTCGCCT GTTCGTCGAC 

GCCTTGCGGC AGGGCGCTGT CATACCTGAT ATCGCCTCGC TCAACGAAGT 
CGGAACGCCG TCCCGCGACA GTATGGACTA TAGCGGAGCG AGTTGCTTCA 

GCCAAAAATC TTTGAGGGTC TTGGACGCGA CGAGAAGCGC GCGGCAAACG 
CGGTTTTTAG AAACTCCCAG AACCTGCGCT GCTCTTCGCG CGCCGTTTGC 

CTCTGCAACA GGAAAACAGC GAAAATGAAA GTCACTCTGG AGTGTTGGTG 
GAGACGTTGT CCTTTTGTCG CTTTTACTTT CAGTGAGACC TCACAACCAC 

GAACTCGAGG GTGACAACGC GCGCCTAGCC GTACTAAAAC GCAGCATCGA 
CTTGAGCTCC CACTGTTGCG CGCGGATCGG CATGATTTTG CGTCGTAGCT 

GGTCACCCAC TTTGCCTACC CGGCACTTAA CCTACCCCCC AAGGTCAT6A 
CCAGTGGGTG AAACGGATGG GCCGTGAATT GGATGGGGGG TTCCAG7ACT 

GCACAGTCAT GAGTGAGCTG ATCGTGCGCC GTGCGCAGCC CCTGGAGAGG 
CGTGTCAGTA CTCACTCGAC TAGCACGCGG CACGCGTCGG GGACCTCTCC 

GATGCAAATT TGCAAGAACA AACAGAGGAG GGCCTACCCG CAGTTGGCGA 
CTACGTTTAA ACGTTCTTGT TTGTCTCCTC CCGGATGGGC GTCAACCGCT 

CGAGCAGCTA GCGCGCTGGC TTCAAACGCG CGAGCCTGCC GACTTGGAGG 
GCTCGTCGAT CGCGCGACCG AAGTTTGCGC GCTCGGACGG CTGAACCTCC 
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TGCATGCAGC GGTTCTTTGC TGACCCGGAG ATGCAGCGCA AGCTAGAGGA 
ACGTACGTCG CCAAGAAACG ACTGGGCCTC TACGTCGCGT TCGATCTCCT 

AACATTGCAC TACACCTTTC GACAGGGCTA CGTACGCCAG GCCTGCAAGA 
TTGTAACGTG ATGTGGAAAG CTGTCCCGAT GCATGC6GTC CGGACGTTCT 

TCTCCAACGT GGAGCTCTGC AACCTGGTCT CCTACCTTGG AATTTTGCAC 
AGAGGTTGCA CCTCGAGACG TTGGACCAGA GGATGGAACC TTAAAACGTG 



GAAAACCGCC TTGGGCAAAA CGTGCTTCAT TCCACGCTCA AGGGCGAGGC 
CTTTTGGCGG AACCCGTTTT GCACGAAGTA AGGTGCGAGT TCCCGCTCCG 

GCGCCGCGAC TACGTCCGCG ACTGCGTTTA CTTATTTCTA TGCTACACCT 
CGCGGCGCTG ATGCAGGCGC TGACGCAAAT GAATAAAGAT ACGATGTGGA 

GGCAGACGGC CATGGGCGTT TGGCAGCAGT GCTTGGAGGA GTGCAACCTC 
CCGTCTGCCG GTACCCGCAA ACCGTCGTCA CGAACCTCCT CACGTTGGAG 

AAGGAGCTGC AGAAACTGCT AAAGCAAAAC TTGAAGGACC TATGGACGGC 
TTCCTCGACG TCTTTGACGA TTTCGTTTTG AACTTCCTGG ATACCTGCCG 

CTTCAACGAG CGCTCCGTGG CCGCGCACCT GGCGGACATC ATTTTCCCCG 
GAAGTTGCTC GCGAGGCACC GGCGCGTGGA CCGCCTGTAG TAAAAGGGGC 

AACGCCTGCT TAAAACCCTG CAACAGGGTC TGCCAGACTT CACCAGTCAA 
TTGCGGACGA ATTTTGGGAC GTTGTCCCAG ACGGTCTGAA GTGGTCAGTT 

AGCATGTTGC AGAACTTTAG GAACTTTATC CTAGAGCGCT CAGGAATCTT 
TCGTACAACG TCTTGAAATC CTTGAAATAG GATCTCGCGA GTCCTTAGAA 

GCCCGCCACC TGCTGTGCAC TTCCTAGCGA CTTTGTGCCC ATTAAGTACC 
CGGGCGGTGG ACGACACGTG AAGGATCGCT GAAACACGGG TAATTCATGG 

GCGAATGCCC TCCGCCGCTT TGGGGCCACT GCTACCTTCT GCAGCTAGCC 
CGCTTACGGG AGGCGGCGAA ACCCCGGTGA CGATGGAAGA CGTCGATCGG 

AACTACCTTG CCTACCACTC TGACATAATG GAAGACGTGA GCGGTGACGG 
TTGATGGAAC GGATGGTGAG ACTGTATTAC CTTCTGCACT CGCCACTGCC 

TCTACTGGAG TGTCACTGTC GCTGCAACCT ATGCACCCCG CACCGCTCCC 
AGATGACCTC ACAGTGACAG CGACGTTGGA TACGTGG6GC GTGGCGAGGG 

TGGTTTGCAA TTCGCAGCTG CTTAACGAAA GTCAAATTAT CGGTACCTTT 
ACCAAACGTT AAGCGTCGAC GAATTGCTTT CAGTTTAATA GCCATGGAAA 

GAGCTGCAGG GTCCCTCGCC TCACGAAAA6 TCCGCGGCTC CGGGGTTGAA 
CTCGACGTCC CAGGGAGCGG ACTGCTTTTC AGGCGCC6AG GCCCCAACTT 

ACTCACTCCG GGGCTGTGGA CGTCGGCTTA CCTTCGCAAA TTTGTACCTG 
TGAGTGAGGC CCCGACACCT GCAGCCGAAT GGAAGCGTTT AAACATGGAC 

AGGACTACCA CGCCCACGAG ATTAGGTTCT ACGAAGACCA ATCCCGCCCG 
TCCTGATGGT GCGGGTGCTC TAATCCAAGA TGCTTCTGGT TAGGGCGGGC 
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CCAATTGCAA GCCATCAACA AAGCCCGCCA AGAGTTTCTG CTACGAAAGG 
GGTTAACGTT CGGTAGTTGT TTCGGGCGGT TCTCAAAGAC GATGCTTTCC 

GACGGGGGGT TTACTTGGAC CCCCAGTCCG GCGAGGAGCT CAACCCAATC 
CTGCCCCCCA AATGAACCTG GGGGTCAGGC CGCTCCTCGA GTTGGGTTAG 

CCCCCGCCGC CGCAGCCCTA TCAGCAGCAG CCGCGGGCCC TTGCTTCCCA 
GGGGGCGGCG GCGTCGGGAT AGTCGTCGTC GGCGCCCGGG AACGAAGGGT 



GGATGGCACC CAAAAAGAAG CTGCAGCTGC CGCCGCCACC CACGGACGAG 
CCTACCGTGG GTTTTTCTTC GACGTCGACG GCGGCGGTGG GTGCCTGCTC 

GAGGAATACT GGGACAGTCA GGCAGAGGAG GTTTTGGACG AGGAGGAGGA 
CTCCTTATGA CCCTGTCAGT CCGTCTCCTC CAAAACCTGC TCCTCCTCCT 

GGACATGATG GAAGACTGGG AGAGCCTAGA CGAGGAAGCT TCCGAGGTCG 
CCTGTACTAC CTTCTGACCC TCTCGGATCT GCTCCTTCGA AGGCTCCAGC 

AAGAGGTGTC AGACGAAACA CCGTCACCCT CGGTCGCATT CCCCTCGCCG 
TTCTCCACAG TCTGCTTTGT GGCAGTGGGA GCCAGCGTAA GGGGAGCGGC 

GCGCCCCAGA AATCGGCAAC CGGTTCCAGC ATGGCTACAA CCTCCGCTCC 
CGCGGGGTCT TTAGCCGTTG GCCAAGGTCG TACCGATGTT GGAGGCGAGG 

TCAGGCGCCG CCGGCACTGC CCGTTCGCCG ACCCAACCGT AGATGGGACA 
AGTCCGCGGC GGCCGTGACG GGCAAGCGGC TGGGTTGGCA TCTACCCTGT 

CCACTGGAAC CAGGGCCGGT AAGTCCAAGC AGCCGCCGCC GTTAGCCCAA 
GGTGACCTTG GTCCCGGCCA TTCAGGTTCG TCGGCGGCGG CAATCGGGTT 

GAGCAACAAC AGCGCCAAGG CTACCGCTCA TGGCGCGGGC ACAAGAACGC 
CTCGTTGTTG TCGCGGTTCC GATGGCGAGT ACCGCGCCCG TGTTCTTGCG 

CATAGTTGCT TGCTTGCAAG ACTGTGGGGG CAACATCTCC TTCGCCCGCC 
GTATCAACGA ACGAACGTTC TGACACCCCC GTTGTAGAGG AAGCGGGCGG 

GCTTTCTTCT CTACCATCAC GGCGTGGCCT TCCCCCGTAA CATCCTGCAT 
CGAAAGAAGA GATGGTAGTG CCGCACCGGA AGGGGGCATT GTAGGACGTA 

TACTACCGTC ATCTCTACAG CCCATACTGC ACCGGCGGCA 6CGGCAGCAA 
ATGATGGCAG TAGAGATGTC GGGTATGACG TGGCCGCCGT CGCCGTCGTT 

CAGCAGCGGC CACACAGAAG CAAAGGCGAC CGGATAGCAA GACTCTGACA 
GTCGTCGCCG GTGTGTCTTC GTTTCCGCTG GCCTATCGTT CTGAGACTGT 

AAGCCCAAGA AATCCACAGC GGCGGCAGCA GCAGGAGGAG GAGCGCTGCG 
TTCGGGTTCT TTAGGTGTCG CCGCCGTCGT CGTCCTCCTC CTCGCGACGC 

TCTGGCGCCC AACGAACCCG TATCGACCCG CGAGCTTAGA AACAGGATTT 
AGACCGCGGG TTGCTTGGGC ATAGCTGGGC GCTCGAATCT TTGTCCTAAA 

TTCCCACTCT GTATGCTATA TTTCAACAGA GCAGGGGCCA AGAACAAGAG 
AAGGGTGAGA CATACGATAT AAAGTTGTCT CGTCCCCGGT TCTTGTTCTC 




83/144 



wo 02/022080 



PCT/USOl/28861 



27551 TCACAAAAGC GAAGATCAGC TTCGGCGCAC GCTGGAAGAC GCGGAGGCTC 
AGTGTTTTCG CTTCTAGTCG AAGCCGCGTG CGACCTTCTG CGCCTCCGAG 

27601 TCTTCAGTAA ATACTGCGCG CTGACTCTTA AGGACTAGTT TCGCGCCCTT 
AGAAGTCATT TATGACGCGC GACTGAGAAT TCCTGATCAA AGCGCGGGAA 

27651 TCTCAAATTT AAGCGCGAAA ACTACGTCAT CTCCAGCGGC CACACCCGGC 
AGAGTTTAAA TTCGCGCTTT TGATGCAGTA GAGGTCGCCG GTGTGGGCCG 



27701 GCCAGCACCT GTTGTCAGCG CCATTATGAG CAAGGAAATT CCCACGCCCT 
CGGTCGTGGA CAACAGTCGC GGTAATACTC GTTCCTTTAA GGGTGCGGGA 

27751 ACATGTGGAG TTACCAGCCA CAAATGGGAC TTGCGGCTGG AGCTGCCCAA 
TGTACACCTC AATGGTCGGT GTTTACCCTG AACGCCGACC TCGACGGGTT 

27801 GACTACTCAA CCCGAATAAA CTACATGAGC GCGGGACCCC ACATGATATC 
CTGATGAGTT GGGCTTATTT GATGTACTCG CGCCCTGGGG TGTACTATAG 

27851 CCGGGTCAAC GGAATACGCG CCCACCGAAA CCGAATTCTC CTGGAACAGG 
GGCCCAGTTG CCTTATGCGC GGGTGGCTTT GGCTTAAGAG GACCTTGTCC 

27901 CGGCTATTAC CACCACACCT CGTAATAACC TTAATCCCCG TAGTTGGCCC 
GCCGATAATG GTGGTGTGGA GCATTATTGG AATTAGGGGC ATCAACCGGG 

27951 GCTGCCCTGG TGTACCAGGA AAGTCCCGCT CCCACCACTG TGGTACTTCC 
CGAC6GGACC ACATGGTCCT TTCAGGGCGA GGGTGGTGAC ACCATGAAGG 

28001. CAGAGACGCC CAGGCCGAAG TTCAGATGAC TAACTCAGGG GCGCAGCTTG 
GTCTCTGCGG GTCCGGCTTC AAGTCTACTG ATTGAGTCCC CGCGTCGAAC 

28051 CGGGCGGCTT TCGTCACAGG GTGCGGTCGC CCGGGCAGGG TATAACTCAC 
GCCCGCCGAA AGCAGfGTCC CACGCCAGCG GGCCCGTCCC ATATTGAGTG 

28101 CTGACAATCA GAGGGCGAGG TATTCAGCTC AACGACGAGT CGGTGAGCTC 
GACTGTTAGT CTCCCGCTCC ATAAGTCGAG TTGCTGCTCA GCCACTCGAG 

28151 CTCGCTTGGT CTCCGTCCGG ACGGGACATT TCAGATCGGC GGCGCCGGCC 
GAGCGAACCA GAGGCAGGCC TGCCCTGTAA AGTCTAGCCG CCGCGGCCGG 

28201 GCtCTTCATT CACGCCTCGT CAGGCAATCC TAACTCTGCA GACCTCGTCC 
CGAGAAGTAA GTGCGGAGCA GTCCGTTAGG ATTGAGACGT CTGGAGCAGG 

28251 TCTGAGCCGC GCTCTGGAGG CATTGGAACT CTGCAATTTA TTGAGGAGTT 
AGACTCGGCG CGAGACCTCC GTAACCTTGA GACGTTAAAT AACTCCTCAA 

28301 TGTGCCATCG GTCTACTTTA ACCCCTTCTC GGGACCTCCC GGCCACTATC 
ACACGGTAGC CAGATGAAAT TGGGGAAGAG CCCTGGAGGG CCGGT6ATAG 

28351 CGGATCAATT TATTCCTAAC TTTGACGCGG TAAAGGACTC GGCGGACGGC 
GCCTAGTTAA ATAAGGATTG AAACTGCGCC ATTTCCTGAG CCGCCTGCCG 

28401 TACGACTGAA TGTTAAGTGG AGAGGCAGAG CAACTGCGCC TGAAACACCT 
ATGCTGACTT ACAATTCACC TCTCCGTCTC GTTGACGCGG ACTTTGTGGA 
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28451 GGTCCACTGT CGCCGCCACA AGTGCTTTGC CCGCGACTCC GGTGAGTTTT 
CCAGGTGACA GCGGCGGTGT TCACGAAACG GGCGCTGAGG CCACTCAAAA 

28501 GCTACTTTGA ATTGCCCGAG GATCATATCG AGGGCCCGGC GCACGGCGTC 
CGATGAAACT TAACGGGCTC CTAGTATAGC TCCCGGGCCG CGTGCCGCAG 

28551 CGGCTTACCG CCCAGGGAGA GCTTGCCCGT AGCCTGATTC GGGAGTTTAC 
GCCGAATGGC GGGTCCCTCT CGAACGGGCA TCGGACTAAG CCCTCAAATG 

28601 CCAGCGCCCC CTGCTAGTTG AGCGGGACAG GGGACCCTGT GTTCTCACTG 
GGTCGCGGGG GACGATCAAC TCGCCCTGTC CCCTGGGACA CAAGAGTGAC 



28651 TGATTTGCAA CTGTCCTAAC CCTGGATTAC ATCAAGATCT TTGTTGCCAT 
ACTAAACGTT GACAGGATTG GGACCTAATG TAGTTCTAGA AACAACGGTA 

28701 CTCTGTGCTG AGTATAATAA ATACAGAAAT TAAAATATAC TGGGGCTCCT 
GAGACACGAC TCATATTATT TATGTCTTTA ATTTTATATG ACCCCGAGGA 

28751 ATCGCCATCC TGTAAACGCC ACCGTCTTCA CCCGCCCAAG CAAACCAAGG 
TAGCGGTAGG ACATTTGCGG TGGCAGAAGT GGGCGGGTTC GTTTGGTTCC 

28801 CGAACCTTAC CTGGTACTTT TAACATCTCT CCCTCTGTGA TTTACAACAG 
GCTTGGAATG GACCATGAAA ATTGTAGAGA GGGAGACACT AAATGTTGTC 

28851 TTTCAACCCA GACGGAGTGA GTCTACGAGA GAACCTCTCC GAGCTCAGCT 
AAAGTTGGGT CTGCCTCACT CAGATGCTCT CTTGGAGAGG CTCGAGTCGA 

28901 ACTCCATCAG AAAAAACACC ACCCTCCTTA CCTGCCGGGA ACGTACGAGT 
TGAGGTAGTC TTTTTTGTGG TGGGAGGAAT GGACGGCCCT TGCATGCTCA 

28951 GCGTCACCGG CCGCTGCACC ACACCTACCG CCTGACCGTA AACCAGACTT 
CGCAGTGGCC GGCGACGTGG TGTGGATGGC GGACTGGCAT TTGGTCTGAA 

29001 TTTCCGGACA GACCTCAATA ACTCTGTTTA CCAGAACAGG AGGTGAGCTT 
AAAGGCCTGT CTGGAGTTAT TGAGACAAAT GGTCTTGTCC TCCACTCGAA 

29051 AGAAAACCCT TAGGGTATTA GGCCAAAGGC GCAGCTACTG TGGGGTTTAT 
TCTTTTGGGA ATCCCATAAT CCGGTTTCCG CGTCGATGAC ACCCCAAATA 

29101 GAACAATTCA AGCAACTCTA CGGGCTATTC TAATTCAGGT TTCTCTAGAA 
CTTGTTAAGT TCGTTGAGAT GCCCGATAAG ATTAAGTCCA AAGAGATCTT 

29151 TCGGGGTTGG GGTTATTCTC T6TCTTGTGA TTCTCTTTAT TCTTATACTA 
AGCCCCAACC CCAATAAGAG ACAGAACACT AAGAGAAATA AGAATATGAT 

29201 ACGCTTCTCT GCCTAAGGCT CGCCGCCTGC TGTGTGCACA TTTGCATTTA 
TGCGAAGAGA CGGATTCCGA GCGGCGGACG ACACACGTGT AAACGTAAAT 

29251 TTGTCAGCTT TTTAAACGCT GGGGTCGCCA CCCAAGATGA TTAGGTACAT 
AACAGTCGAA AAATTTGCGA CCCCAGCGGT GGGTTCTACT AATCCATGTA 

29301 AATCCTAGGT TTACTCACCC TTGCGTCAGC CCACGGTACC ACCCAAAAGG 
TTAGGATCCA AATGAGTGGG AACGCAGTCG GGTGCCATGG TGGGTTTTCC 

29351 TGGATTTTAA GGAGCCAGCC TGTAATGTTA CATTCGCAGC TGAAGCTAAT 
ACCTAAAATT CCTCGGTCGG ACATTACAAT GTAAGCGTCG ACTTCGATTA 
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29451 TCGCCACAAA AACAAAATTG 6CAAGTATGC TGTTTATGCT ATTTGGCAGC 
AGCGGTGTTT TTGTTTTAAC CGTTCATAC6 ACAAATACGA TAAACCGTCG 

29501 CAGGTGACAC TACAGAGTAT AATGTTACAG TTTTCCAGGG TAAAAGTCAT 
GTCCACTGTG ATGTCTCATA TTACAATGTC AAAAGGTCCC ATTTTCAGTA 

29551 AAAACTTTTA TGTATACTTT TCCATTTTAT GAAATGTGCG ACATTACCAT 
TTTTGAAAAT ACATATGAAA AGGTAAAATA CTTTACACGC TGTAATGGTA 



29601 GTACATGAGC AAACAGTATA AGTTGTGGCC CCCACAAAAT TGTGTGGAAA 
CATGTACTCG TTTGTCATAT TCAACACCGG 6GGTGTTTTA ACACACCTTT 

29651 ACACTGGCAC TTTCTGCTGC ACTGCTATGC TAATTACAGT GCTCGCTTTG 
TGTGACCGTG AAAGACGACG TGACGATACG ATTAATGTCA CGAGCGAAAC 

29701 GTCTGTACCC TACTCTATAT TAAATACAAA AGCAGACGCA GCTTTATTGA 
CAGACATGGG ATGAGATATA ATTTATGTTT TCGTCTGCGT CGAAATAACT 

29751 GGAAAAGAAA ATGCCTTAAT TTACTAAGTT ACAAAGCTAA TGTCACCACT 
CCTTTTCTTT TACGGAATTA AATGATTCAA TGTTTCGATT ACAGTGGTGA 

29801 AACTGCTTTA CTCGCTGCTT GCAAAACAAA TTCAAAAAGT TAGCATTATA 
TTGACGAAAT GAGCGACGAA CGTTTTGTTT AAGTTTTTCA ATGGTAATAT 

29851 ATTAGAATAG GATTTAAACC CCCCGGTCAT TTCCTGCTCA ATACCATTCC 
TAATCTTATC CTAAATTTGG GGGGCCAGTA AAGGACGAGT TATGGTAAGG 

79901 CCTGAACAAT TGACTCTATG TGGGATATGC TCCAGCGCTA CAACCTTGAA 
GGACTTGTTA ACTGAGATAC ACCCTATACG AGGTCGCGAT GTTGGAACTT 

29951 GTCAGGCTTC CTGGATGTCA GCATCTGACT TTGGCCAGCA CCTGTCCCGC 
CAGTCCGAAG GACCTACAGT CGTAGACTGA AACCGGTCGT GGACAGGGCG 

30001 GGATTTGTTC CAGTCCAACT ACAGCGACCC ACCCTAACAG AGATGACCAA 
CCTAAACAAG GTCAGGTTGA - TGTCGCTGGG TGGGATTGTC TCTACTGGTT 

30051 CACAACCAAC GCGGCCGCCG CTACCGGACT TACATCTACC ACAAATACAC 
GTGTTGGTTG CGCCGGCGGC GATGGCCTGA ATGTAGATGG TGTTTATGTG 

30101 CCCAAGTTTC TGCCTTTGTC AATAACTGGG ATAACTTGGG CATGTGGTGG 
GGGTTCAAAG ACGGAAACAG TTATTGACCC TATTGAACCC GTACACCACC 

30151 TTCTCCATAG CGCTTATGTT TGTATGCCTT ATTATTATGT GGCTCATCTG 
AAGAGGTATC GCGAATACAA ACATACGGAA TAATAATACA CCGAGTAGAC 

30201 CTGCCTAAAG CGCAAACGCG CCCGACCACC CATCTATAGT CCCATCATTG 
GACGGATTTC GCGTTTGCGC GGGCTGGTGG GTAGATATCA GGGTAGTAAC 

30251 TGCTACACCC AAACAATGAT GGAATCCATA GATTGGACGG ACTGAAACAC 
ACGATGTGGG TTTGTTACTA CCTTAGGTAT CTAACCTGCC TGACTTTGTG 

30301 ATGTTCTTTT CTCTTACAGT ATGATTAAAT GAGACATGAT TCCTCGAGTT 
TACAAGAAAA GAGAATGTCA TACTAATTTA CTCTGTACTA AGGAGCTCAA 
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30401 TGCGGTTTCT CACATCGAAG TAGACTGCAT TCCAGCCTTC ACAGTCTATT 
ACGCCAAAGA GTGTAGCTTC ATCTGACGTA AGGTCGGAAG TGTCAGATAA 

30451 TGCTTTACGG ATTTGTCACC CTCACGCTCA TCTGCAGCCT CATCACTGTG 
ACGAAATGCC TAAACAGTGG GAGTGCGAGT AGACGTCGGA GTAGTGACAC 

30501 GTCATCGCCT TTATCCAGTG CATTGACTGG GTCTGTGTGC GCTTTGCATA 
CAGTAGCGGA AATAGGTCAC GTAACTGACC CAGACACACG CGAAACGTAT 



.30551 TCTCAGACAC CATCCCCAGT ACAGGGACAG GACTATAGCT GAGCTTCTTA 
AGAGTGTGTG GTAGGGGTCA TGTCCCTGTC CTGATATCGA CTC6AAGAAT 

30601 GAATTCTTTA ATTATGAAAT TTACTGTGAC TTTTCTGCTG ATTATTTGCA 
CTTAAGAAAT TAATACTTTA AATGACACTG AAAAGACGAC TAATAAACGT 

30651 CCCTATCTGC GTTTTGTTCC CCGACCTCCA AGCCTCAAAG ACATATATCA 
GGGATAGACG CAAAACAAGG GGCTGGAGGT TCGGAGTTTC TGTATATAGT 

30701 TGCAGATTCA CTCGTATATG GAATATTCCA AGTTGCTACA ATGAAAAAAG 
ACGTCTAAGT GAGCATATAC CTTATAAGGT TCAACGATGT TACTTTTTTC 

30751 CGATCTTTCC GAAGCCTGGT TATATGCAAT CATCTCTGTT ATGGTGTTCT 
GCTAGAAAGG CTTCGGACCA ATATACGTTA GTAGAGACAA TACCACAAGA 

30801 GCAGTACCAT CTTAGCCCTA GCTATATATC CCTACCTTGA CATTGGCTGG 
CGTCATGGTA GAATCGGGAT CGATATATAG GGATGGAACT GTAACCGACC 

30851 AACGCAATAG ATGCCATGAA CCACCCAACT TTCCCCGCGC CCGCTATGCT 
TTGCGTTATC TACGGTACTT GGTGGGTTGA AAGGGGCGCG GGCGATACGA 

30901 TCCACTGCAA CAAGTTGTTG CCGGCGGCTT TGTCCCAGCC AATCAGCCTC 
AGGTGACGTT GTTCAACAAq GGCCGCCGAA ACAGGGTCGG TTAGTCGGAG 

30951 GCCCACCTTC TCCCACCCCC ACTGAAATCA GCTACTTTAA TCTAACAGGA 
CGGGTGGAAG AGGGTGGGGG TGACTTTAGT CGATGAAATT AGATTGTCCT 

31001 GGAGATGACT GACACCCTAG ATCTAGAAAT GGACGGAATT ATTACAGAGC 
CCTCTACTGA CTGTGGGATC TAGATCTTTA CCTGCCTTAA TAATGTCTCG 

31051 AGCGCCTGCT AGAAAGACGC AGGGCAGCGG CCGAGCAACA GCGCATGAAT 
TCGCGGACGA TCTTTCTGCG TCCCGTCGCC GGCTC6TTGT CGCGTACTTA 

31101 CAAGAGCTCC AAGACATGGT TAACTTGCAC CAGTGCAAAA GGGGTATCTT 
GTTCTCGAGG TTCTGTACCA ATTGAACGTG GTCACGTTTT CCCCATAGAA 

31151 TTGTCTCGTA AAGCAGGCCA AAGTCACCTA CGACAGTAAT ACCACCGGAC 
AACAGAGCAT TTCGTCCGGT TTCAGTGGAT GCTGTCATTA TGGTGGCCTG 

31201 ACCGCCTTAG CTACAAGTTG CCAACCAAGC GTCAGAAATT GGTGGTCATG 
TGGCGGAATC GATGTTCAAC GGTTGGTTCG CAGTCTTTAA CCACCAGTAC 

31251 GTGGGAGAAA AGCCCATTAC CATAACTCAG CACTCGGTAG AAACCGAAGG 
CACCCTCTTT TCGGGTAATG GTATTGAGTC GTGAGCCATC TTTGGCTTCC 
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31351 AGACCCTGTG CGGTCTCAAA GATCTTATTC CCTTTAACTA ATAAAAAAAA 
TCTGGGACAC GCCAGAGTTT CTAGAATAAG GGAAATTGAT TATTTTTTTT 

31401 ATAATAAAGC ATCACTTACT TAAAATCAGT TAGCAAATTT CTGTCCAGTT 
TATTATTTCG TAGTGAATGA ATTTTAGTCA ATCGTTTAAA GACAGGTCAA 

31451 TATTCAGCAG CACCTCCTTG CCCTCCTCCC AGCTCTGGTA TTGCAGCTTC 
ATAAGTCGTC GTGGAGGAAC GGGAGGAGGG TCGAGACCAT AACGTCGAAG 



31501 CTCCTGGCTG CAAACTTTCT CCACAATCTA AATGGAATGT CAGTTTCCTC 
GAGGACCGAC GTTTGAAAGA GGTGTTAGAT TTACCTTACA GTCAAAGGAG 

31551 CTGTTCCTGT CCATCCGCAC CCACTATCTT CATGTTGTTG CAGATGAAGC 
GACAAGGACA GGTPiGOCGTG GGTGATAGAA GTACAACAAC GTCTACTTCG 

31601 GCGCAAGACC GTCTGAAGAT ACCTTCAACC CCGTGTATCC ATATGACACG 
CGCGTTCTGG CAGACTTCTA TGGAAGTTGG GGCACATAGG TATACTGTGC 

31651 GAAACCGGTC CTCCAACTGT GCCTTTTCTT ACTCCTCCCT TTGTATCCCC 
CTTTGGCCAG GAGGTTGACA CGGAAAAGAA TGAGGAGGGA AACATAGGGG 

31701 CAATGGGTTT CAAGAGAGTC CCCCTGGGGT ACTCTCTTTG CGCCTATCCG 
GTTACCCAAA GTTCTCTCAG GGGGACCCCA TGAGAGAAAC GCGGATAGGC 

31751 AACCTCTAGT TACCTCCAAT GGCATGCTTG CGCTCAAAAT GGGCAACGGC 
TTGGAGATCA ATGGAGGTTA CCGTACGAAC GCGAGTTTTA CCCGTTGCCG 

31801 CTCTCTCTGG ACGAGGCCGG CAACCTTACC TCCCAAAATG TAACCACTGT 
GAGAGAGACC TGCTCCGGCC GTTGGAATGG AGGGTTTTAC ATTGGTGACA 

31851 GAGCCCACCT CTCAAAAAAA CCAAGTCAAA CATAAACCTG GAAATATCTG 
CTCGGGTGGA GAGTTTTTTT GGTTCAGTTT GTATTTGGAC CTTTATAGAC 

31901 CACCCCTCAC AGTTACCTCA GAAGCCCTAA CTGTGGCTGC CGCCGCACCT 
GTGGGGAGTG TCAATGGAGT CTTCGGGATT GACACCGACG GCGGCGTGGA 

31951 CTAATGGTCG CGGGCAACAC ACTCACCATG CAATCACAGG CCCCGCTAAC 
GATTACCAGC GCCCGTTGTG TGAGTGGTAC GTTAGTGTCC GGGGCGATTG 

32001 CGTGCACGAC TCCAAACTTA GCATTGCCAC CCAAGGACCC CTCACAGTGT 
GCACGTGCTG AGGTTTGAAT CGTAACGGTG GGTTCCTGGG GAGTGTCACA 

32051 CAGAAGGAAA GCTAGCCCTG CAAACATCAG GCCCCCTCAC CACCACCGAT 
GTCTTCCTTT CGATCGGGAC GTTTGTAGTC CGGGGGAGTG GTGGTGGCTA 

32101 AGCA6TACCC TTACTATCAC TGCCTCACCC CCTCTAACTA CTGCCACTGG 
TCGTCATGG6 AATGATAGTG ACGGAGTGGG GGAGATTGAT GACGGTGACC 

32151 TAGCTTGGGC ATTGACTTGA AAGAGCCCAT TTATACACAA AATGGAAAAC 
ATCGAACCCG TAACTGAACT TTCTCGGGTA AATATGTGTT TTACCTTTTG 

32201 TAGGACTAAA GTACGGGGCT CCTTTGCATG TAACAGACGA CCTAAACACT 
ATCCTGATTT CATGCCCCGA GGAAACGTAC ATTGTCTGCT GGATTTGTGA 
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32301 
32351 
32401 

32451 
32501 
32551 
32601 
32651 
32701 
32751 
32801 
32B51 
32901 
32951 
33001 
33051 
33101 
33151 



• AACTAAAGTT ACTGGAGCCT TGGGTTTTGA TTCACAAGGC AATATGCAAC 
TTGATTTCAA TGACCTCGGA ACCCAAAACT AAGTGTTCCG TTATACGTTG 

TTAATGTAGC AGGAGGACTA AGGATTGATT CTCAAAACAG ACGCCTTATA 
AATTACATCG TCCTCCTGAT TCCTAACTAA GAGTTTTGTC TGCGGAATAT 

CTTGATGTTA GTTATCCGTT TGATGCTCAA AACCAACTAA ATCTAAGACT 
GAACTACAAT CAATAGGCAA ACTACGAGTT TTGGTTGATT TAGATTCTGA 



AGGACAGGGC CCTCTTTTTA TAAACTCAGC CCACAACTTG GATATTAACT 
TCCTGTCCCG GGAGAAAAAT ATTTGAGTCG GGTGTTGAAC CTATAATTGA 

ACAACAAAGG CCTTTACTTG TTTACAGCTT CAAACAATTC CAAAAAGCTT 
TGTTGTTTCC GGAAATGAAC AAATGTCGAA GTTTGTTAAG GTTTTTCGAA 

GAGGTTAACC TAAGCACTGC CAAGGGGTTG ATGTTTGACG CTACAGCCAT 
CTCCAATTGG ATTCGTGACG GTTCCCCAAC TACAAACTGC GATGTCGGTA 

AGCCATTAAT GCAGGAGATG GGCTTGAATT TGGTTCACCT AATGCACCAA 
TCGGTAATTA CGTCCTCTAC CCGAACTTAA ACCAAGTGGA TTACGTG6TT 

ACACAAATCC CCTCAAAACA AAAATTGGCC ATGGCCTAGA ATTTGATTCA 
TGTGTTTAGG. GGAGTTTTGT TTTTAACCGG TACCGGATCT TAAACTAAGT 

AACAAGGCTA TGGTTCCTAA ACTAGGAACT GGCCTTAGTT TTGACAGCAC 
TTGTTCCGAT ACCAAGGATT TGATCCTTGA CCGGAATCAA AACTGTCGTG 

AGGTGCCATT ACAGTAGGAA ACAAAAATAA TGATAAGCTA ACTTTGTGGA 
TCCACGGTAA TGTCATCCTT TGTTTTTATT ACTATTCGAT TGAAACACCT 

CCACACCAGC TCCATCTCCT AACTGTAGAC TAAATGCAGA GAAAGATGCT 
GGTGTGGTCG AGGTAGAGGA TTGACATCTG ATTTACGTCT CTTTCTACGA 

AAACTCACTT TGGTCTTAAC AAAATGTGGC AGTCAAATAC TTGCTACAGT 
TTTGAGTGAA ACCAGAATTG TTTTACACCG TCAGTTTATG AACGATGTCA 

TTCAGTTTTG GCTGTTAAAG GCAGTTTGGC TCCAATATCT GGAACAGTTC 
AAGTCAAAAC CGACAATTTC CGTCAAACCG AGGTTATAGA CCTTGTCAAG 

AAAGTGCTCA TCTTATTATA AGATTTGACG AAAATGGAGT GGTACTAAAC 
TTTCACGAGT AGAATAATAT TCTAAACTGC TTTTACCTCA CGATGATTTG 

AATTCCTTCC TGGACCCAGA ATATTGGAAC TTTAGAAATG GAGATCTTAC 
TTAAGGAAGG ACCTGGGTCT TATAACCTTG AAATCTTTAC CTCTAGAATG 

TGAAGGCACA GCCTATACAA ACGCTGTTGG ATTTATGCCT AACCTATCAG 
ACTTCCGTGT CGGATATGTT TGCGACAACC TAAATACGGA TTGGATAGTC 

CTTATCCAAA ATCTCACGGT AAAACTGCCA AAAGTAACAT TGTCA6TCAA 
6AATAGGTTT TAGAGTGCCA TTTT6ACGGT TTTCATTGTA ACAGTCAGTT 

GTTTACTTAA ACGGAGACAA AACTAAACCT GTAACACTAA CCATTACACT 
CAAATGAATT TGCCTCTGTT TTGATTTGGA CATTGTGATT GGTAATGTGA 
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33251 CATTTTCATG GGACTGGTCT GGCCACAACT ACATTAATGA AATATTTGCC 
GTAAAAGTAC CCTGACCAGA CCGGTGTTGA TGTAATTACT TTATAAACGG 

33301 ACATCCTCTT ACACTTTTTC ATACATTGCC CAAGAATAAA GAATCGTTTG 
TGTAGGAGAA TGTGAAAAAG TATGTAACGG GTTCTTATTT CTTAGCAAAC 

33351 TGTTATGTTT CAACGTGTTT ATTTTTCAAT TGCAGAAAAT TTCAAGTCAT 
ACAATACAAA GTTGCACAAA TAAAAAGTTA ACGTCTTTTA AAGTTCAGTA 



33401 TTTTCATTCA GTAGTATAGC CCCACCACCA CATAGCTTAT ACAGATCACC 
AAAAGTAAGT CATCATATCG GGGTGGTGGT GTATCGAATA TGTCTAGTGG 

33451 GTACCTTAAT CAAACTCACA GAACCCTAGT ATTCAACCT6 CCACCTCCCT 
CATGGAATTA GTTTGAGTGT CTTGGGATCA TAAGTTGGAC GGTGGAGGGA 

33501 CCCAACACAC AGAGTACACA GTCCTTTCTC CCCGGCTGGC CTTAAAAAGC 
GGGTTGTGTG TCTCATGTGT CAGGAAAGAG GGGCCGACCG GAATTTTTCG 

3-^551 ATCATATCAT GGGTAACAGA CATATTCTTA GGTGTTATAT TCCACACGGT 
TAGTATAGTA CCCATTGTCT GTATAAGAAT CCACAATATA AGGTGTGCCA 

33601 TTCCTGTCGA GCCAAACGCT CATCAGTGAT ATTAATAAAC TCCCCGGGCA 
AAGGACAGCT CGGTTTGCGA GTAGTCACTA TAATTATTTG AGGGGCCCGT 

33651 GCTCACTTAA GTTCATGTCG CTGTCCAGCT GCTGAGCCAC AGGCTGCTGT 
CGAGTGAATT CAAGTACAGC GACAGGTCGA CGACTCGGTG TCCGACGACA 

33701 CCAACTTGCG GTTGCTTAAC GGGCGGCGAA GGAGAAGTCC ACGCCTACAT 
GGTTGAACGC CAACGAATTG CCCGCCGCTT CCTCTTCAGG TGCGGATGTA 

33751 GGGGGTAGAG TCATAATCGT GCATCAGGAT AGGGCGGTGG TGCTGCAGCA 
CCCCCATCTC AGTATTAGCA CGTAGTCCTA TCCCGCCACC ACGACGTCGT 

33801 GCGCGCGAAT AAACTGCTGC C6CCGCCGCT CCGTCCTGCA GGAATACAAC 
CGCGCGCTTA TTTGACGACG GCGGCGGCGA GGCAGGACGT CCTTATGTTG 

33851 ATGGCAGTGG TCTCCTCAGC GATGATTCGC ACCGCCCGCA GCATAAGGCG 
TACCGTGACC AGAGGAGTCG CTACTAAGCG TGGCGGGCGT CGTATTCCGC 

33901 CCTTGTCCTC CGGGCACAGC AGCGCACCCT GATCTCACTT AAATCAGCAC 
GGAACAGGAG GCCCGTGTCG TCGCGTGGGA CTAGAGTGAA TTTAGTCGTG 

33951 AGTAACTGCA GCACAGCACC ACAATATTGT TCAAAATCCC ACAGTGCAAG 
TCATTGACGT CGTGTCGTGG TGTTATAACA AGTTTTAGGG TGTCACGTTC 

34001 GCGCTGTATC CAAAGCTCAT GGCGGGGACC ACAGAACCCA CGTGGCCATC 
CGCGACATAG GTTTCGAGTA CCGCCCCTGG TGTCTTGGGT GCACCGGTAG 

34051 ATACCACAAG CGCAGGTAGA TTAAGTGGCG ACCCCTCATA AACACGCTGG 
TATGGTGTTC GCGTCCATCT AATTCACC6C TGGGGAGTAT TTGTGCGACC 

34101 ACATAAACAT TACCTCTTTT GGCATGTTGT AATTCACCAC CTCCCGGTAC 
TGTATTTGTA ATGGAGAAAA CCGTACAACA TTAAGTGGTG GAGGGCCATG 




90/144 



wo 02/022080 



PCT/USOl/28861 



34201 
34251 
34301 

34351 
34401 
34451 
34501 
34551 
34601 
34651 
34701 
34751 
34801 
34B51 
34901 
349SX 
35001 
35051 



GCTGGCCAAA ACCTGCCCGC CGGCTATACA CTGCAGGGAA CCGGGACTGG 
CGACCGGTTT TGGACGGGCG GCCGATATGT GACGTCCCTT GGCCCTGACC 

AACAATGACA GTGGAGAGCC CAGGACTCGT AACCATGGAT CATCATGCTC 
TTGTTACTGT CACCTCTCGG GTCCTGAGCA TTGGTACCTA GTAGTACGAG 

GTCATGATAT CAATGTTGGC ACAACACAGG CACACGTGCA TACACTTCCT 
CAGTACTATA 6TTACAACCG TGTTGTGTCC GTGTGCACGT ATGTGAAGGA 



CAGGATTACA AGCTCCTCCC GCGTTAGAAC CATATCCCA6 6GAACAACCC 
GTCCTAATGT TCGAGGAGGG CGCAATCTTG GTATAGGGTC CCTTGTTGGG 

ATTCCTGAAT CAGCGTAAAT CCCACACTGC AGGGAAGACC TCGCACGTAA 
TAAGGACTTA GTCGCATTTA GGGTGTGACG TCCCTTCTGG AGCGTGCATT 

CTCACGTTGT GCATTGTCAA AGTGTTACAT TCGGGCAGCA GCGGATGATC 
GAGTGCAACA CGTAACAGTT TCACAATGTA AGCCCGTCGT CGCCTACTAG 

CTCCAGTATG GTAGCGCGGG TTTCTGTCTC AAAAGGAGGT AGACGATCCC 
GAGGTCATAC CATCGCGCCC AAAGACAGAG TTTTCCTCCA TCTGCTAGGG 

TACTGTACGG AGTGCGCCGA GACAACCGAG ATCGTGTTGG TCGTAGTGTC 
ATGACATGCC TCACGCGGCT CTGTTGGCTC TAGCACAACC AGCATCACAG 

ATGCCAAATG GAACGCCGGA CGTAGTCATA TTTCCTGAAG CAAAACCAGG 
TACGGTTTAC CTTGCGGCCT GCATCAGTAT AAAGGACTTC GTTTTGGTCC 

TGCGGGCGTG ACAAACAGAT CTGCGTCTCC GGTCTCGCCG CTTAGATCGC 
ACGCCCGCAC TGTTTGTCTA GACGCAGAGG CCAGAGCGGC GAATCTAGCG 

TCTGTGTAGT AGTTGTAGTA TATCCACTCT CTCAAAGCAT CCAGGCGCCC 
AGACACATCA TCAACATCAT ATAGGTGAGA GAGTTTCGTA GGTCCGCGGG 

CCTGGCTTCG GGTTCTATGT AA^CTCCTTC ATGCGCCGCT GCCCTGATAA 
GGACCGAAGC CCAAGATACA TTTGAGGAAG TACGCGGCGA CGGGACTATT 

CATCCACCAC CGCAGAATAA GCCACACCCA GCCAACCTAC ACATTCGTTC 
GTAGGTGGTG 6CGTCTTATT CGGTGTGGGT CGGTTGGATG TGTAAGCAAG 

TGCGAGTCAC ACACGGGAGG AGCGGGAAGA GCTGGAAGAA CCATGTTTrr 
ACGCTCAGTG TGTGCCCTCC TCGCCCTTCT CGACCTTCTT GGTACAAAAA 

TTTTTTATTC CAAAAGATTA TCCAAAACCT CAAAATGAAG ATCTATTAAG 
AAAAAATAAG GTTTTCTAAT AGGTTTTGGA GTTTTACTTC TAGATAATTC 

TGAACGCGCT CCCCTCCGGT GGCGTGGTCA AACTCTACAG CCAAA6AACA 
ACTTGCGCGA GGGGAG6CCA CCGCACCAGT TTGAGATGTC GGTTTCTTGT 

GATAATGGCA TTTGTAAGAT GTTGCACAAT GGCTTCCAAA AGGCAAACGG 
CTATTACCGT AAACATTCTA CAACGTGTTA CGGAAGGTTT TCCGTTTGCC 

CCCTCACGTC CAAGTGGACG TAAAGGCTAA ACCCPTCAGG GTGAATCTCC 
GGGAGTGCAG GTTCACCTGC ATTTCCGATT TGGGAAGTCC CACTTAGAGG 
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35151 CCACCTTCTC AATATATCTC TAAGCAAATC CCGAATATTA AGTCCGGCCA 
GGTGGAAGAG TTATATAGAG ATTCGTTTAG GGCTTATAAT TCAGGCCGGT 

35201 TTGTAAAAAT CTGCTCCAGA GCGCCCTCCA CCTTCAGCCT CAAGCAGCGA 
AACATTTTTA GACGAGGTCT CGCGGGAGGT GGAAGTCGGA GTTCGTCGCT 

35251 ATCATGATTG CAAAAATTCA GGTTCCTCAC AGACCTGTAT AAGATTCAAA 
TAGTACTAAC GTTTTTAAGT CCAAGGAGTG TCTGGACATA TTCTAAGTTT 



35301 AGCGGAACAT TAACAAAAAT ACCGCGATCC CGTAGGTCCC TTCGCAGGGC 
TCGCCTTGTA ATTGTTTTTA TGGCGCTAGG GCATCCAGGG AAGCGTCCCG 

35351 CAGCTGAACA TAATCGTGCA GGTCTGCACG 6ACCAGCGCG GCCACTTCCC 
GTCGACTTGT ATTAGCACGT CCAGACGTGC CTGGTCGCGC CGGTGAAGGG 

35401 CGCCAGGAAC CATGACAAAA GAACCCACAC TGATTATGAC ACGCATACTC 
GCGGTCCTTG GTACTGTTTT CTTGGGTGTG ACTAATACTG TGC6TATGAG 

35451 GGAGCTATGC TAACCAGCGT AGCCCCGATG TAAGCTTGTT GCATGGGCGG 
CCTCGATACG ATTGGTCGCA TCGGGGCTAC ATTCGAACAA CGTACCCGCC 

35501 CGATATAAAA TGCAAGGTGC TGCTCAAAAA ATCAGGCAAA GCCTCGCGCA 
GCTATATTTT ACGTTCCACG ACGAGTTTTT TAGTCCGTTT CGGAGCGCGT 

35551 AAAAAGAAAG CACATCGTAG TCATGCTCAT GCAGATAAAG GCAGGTAAGC 
TTTTTCTTTC GTGTAGCATC AGTACGAGTA CGTCTATTTC CGTCCATTCG 

35601 TCCGGAACCA CCACAGAAAA AGACACCATT TTTCTCTCAA ACATGTCTGC 
AGGCCTTGGT GGTGTCTTTT TCTGTGGTAA AAAGAGAGTT TGTACAGACG 

35651 GGGTTTCTGC ATAAACACAA AATAAAATAA CAAAAAAACA TTTAAACATT 
CCCAAAGACG TATTTGTGTT TTATTTTATT GTTTTTTTGT AAATTTGTAA 

35701 AGAAGCCTGT CTTACAACAG GAAAAACAAC CCTTATAAGC ATAAGACGGA 
TCTTCGGACA GAATGTTGTC CTTTTTGTTG GGAATATTCG TATTCTGCCT 

35751 CTACGGCCAT GCCGGCGTGA CCGTAAAAAA ACTGGTCACC GTGATTAAAA 
GATGCCGGTA CGGCCGCACT GGCATTTTTT TGACCAGTGG CACTAATTTT 

35801 AGCACCACCG ACAGCTCCTC GGTCATGTCC GGAGTCATAA TGTAAGACTC 
TCGTGGTGGC TGTCGAGGAG CCAGTACAGG CCTCAGTATT ACATTCTGAG 

35851 GGTAAACACA TCAGGTTGAT TCACATCGGT CAGTGCTAAA AAGCGACCGA 
CCATTTGTGT AGTCCAACTA AGTGTAGCCA GTCACGATTT TTCGCTGGCT 

35901 AATAGCCCGG GGGAATACAT ACCCGCAGGC GTAGAGACAA CATTACAGCC 
TTATCGGGCC CCCTTATGTA TGGGCGTCCG CATCTCTGTT GTAATGTCGG 

35951 CCCATAGGAG GTATAACAAA ATTAATAGGA GAGAAAAACA CATAAACACC 
GGGTATCCTC CATATTGTTT TAATTATCCT CTCTTTTTGT GTATTTGTGG 

36001 TGAAAAACCC TCCTGCCTAG GCAAAATAGC ACCCTCCCGC TCCAGAACAA 
ACTTTTTGGG AGGACGGATC CGTTTTATCG TGGGAGGGCG AGGTCTTGTT 
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36101 AAAGAAAACC TATTAAAAAA ACACCACTCG ACACGGCACC AGCTCAATCA 
TTTCTTTTGG ATAATTTTTT TGTGGTGAGC TCrGCCGTGG TCGAGTTAGT 

36151 GTCACAGTGT AAAAAAGGGC CAAGTGCAGA GCGAGTATAT ATAGGACTAA 
CAGTGTCACA TTTTTTCCCG GTTCACGTCT CGCTCATATA TATCCTGATT 

36201 AAAATGACGT AACGGTTAAA GTCCACAAAA AACACCCAGA AAACCGCACG 
TTTTACTGCA TTGCCAATTT CAGGTGTTTT TTGTGGGTCT TTTGGCGTGC 



36251 CGAACCTACG CCCAGAAACG AAAGCCAAAA AACCCACAAC TTCCTCAAAT 
GCTTGGATGC GGGTCTTTGC TTTCGGTrTT TTGGGTGTT6 AAGGAGTTTA 

36301 CGTCACTTCC GTTTTCCCAC GTTACGTCAC TTCCCATTTT AAGAAAACTA 
GCAGTGAAGG GAAAAGGGTG CAATGCAGTG AAGGGTAAAA TTCTTTTGAT 

36351 CAATTCCCAA CACATACAAG TTACTCCGCC CTAAAACCTA CGTCACCCGC 
GTTAAGGGTT GTGTATGTTC AATGAGGCGG GATTTTGGAT GCAGTGGGCG 

364 01 CCCGTTCCCA CGCCCCGCGC CACGTCACAA ACTCCACCCC CTCATTATCA 
GGGCAAGGGT GCGGGGCGCG GTGCAGTGTT TGAGGTGGGG GAGTAATAGT 

Pad 



36451 TATTGGCTTC AATCCAAAAT AAGGTATATT ATTGATGATG TTAATTAAGA 
ATAACCGAAG TTAGGTTTTA TTCCATATAA TAACTACTAC AATTAATTCT 

36501 ATTCGGATCT GCGACGCGAG GCTGGATGGC CTTCCCCATT ATGATTCTTC 
TAAGCCTAGA CGCTGCGCTC CGACCTACCG GAAGGGGTAA TACTAAGAAG 

36551 TCGCTTCCGG CGGCATCGGG ATGCCCGCGT TGCAGGCCAT GCTGTCCPiGG 
AGCGAAGGCC GCCGTAGCCC TACGGGCGCA ACGTCCGGTA CGACAGGTCC 

36601 CAGGTAGATG ACGACCATCA 6GGACAGCTT CAAGGCCAGC AAAAGGCCAG 
GTCCATCTAC TGCTGGTAGT CCCTGTCGAA GTTCCGGTCG TTTTCCGGTC 

36651 6AACCGTAAA AAGGCCGCGT TGCTGGCGTT TTTCCATAGG CTCCGCCCCC 
CTTGGCATTT TTCCGGCGCA ACGACCGCAA AAAGGTATCC GAGGCGGGGG 

36701 CTGACGAGCA TCACAAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG 
GACTGCTCGT AGTGTTTTTA GCTGCGAGTT CAGTCTCCAC CGCTTTGGGC 

36751 ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG 
TGTCCTGATA TTTCTATGGT CCGCAAAGGG GGACCTTCGA GGGAGCACGC 

36801 CTCTCCTGTT CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC 
GAGAGGACAA GGCTGGGACG GCGAATGGCC TATGGACAGG CGGAAAGAGG 

36851 CTTCGGGAAG CGTGGCGCTT TCTCATAGCT CACGCTGTAG GTATCTCAGT 
GAAGCCCTTC 6CACCGCGAA A6AGTATCGA GTGCGACATC CATAGAGTCA 

36901 TCGGTGTAGG TCGTTCGCTC CAAGCTGGGC TGTGTGCfiCG AACCCCCCGT 
AGCCACATCC AGCAAGCGAG GTTCGACCCG ACACACGTGC TTGGGGGGCA 
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37001 
37051 
37101 
37151 
37201 
37251 
37301 
37351 
37401 
37451 
37501 
37551 
37601 
37651 
37701 
37751 
37801 
37851 



CGGTAAGACA CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT 
6CCATTCTGT GCTGAATAGC GGTGACCGTC GTCGGTGACC ATTGTCCTAA 

AGCAGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA AGTGGTGGCC 
TCGTCTCGCT CCATACATCC GCCACGATGT CTCAAGAACT TCACCACCGG 

TAACTACGGC TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA 
ATTGATGCCG ATGTGATCTT CCTGTCATAA ACCATAGACG CGAGACGACT 

AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA GCTCTTGATC CGGCAAACAA 
TCGGTCAATG GAAGCCTTTT TCTCAACCAT CGAGAACTAG GCCGTTTGTT 

ACCACCGCT6 GTAGCGGTGG TT T TTTTGTT TGCAAGCAGC AGATTACGCG 
TGGTGGCGAC CATCGCCACC AAAAAAACAA ACGTTCGTCG TCTAATGCGC 

CAGAAAAAAA GGATCTCAAG AAGATCCTTT GATCTTTTCT ACGGGGTCTG 
GTCTTTTTTT CCTAGAGTTC TTCTAGGAAA CTAGAAAAGA TGCCCCAGAC 

ACGCTCAGTG GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA 
TGCGAGTCAC CTTGCTTTTG AGTGCAATTC CCTAAAACCA GTACTCTAAT 

TCAAAAAGGA TCTTCACCTA GATCCTTTTA AATCAATCTA AAGTATATAT 
AGTTTTTCCT AGAAGTGGAT CTAGGAAAAT TTAGTTAGAT TTCATATATA 

GAGTAAACTT GGTCTGACAG TTACCAATGC TTAATCAGTG AGGCACCTAT 
CTCATTTGAA CCAGACTGTC AATGGTTACG AATTAGTCAC TCCGTGGATA 

CTCAGCGATC TGTCTATTTC GTTCATCCAT AGTTGCCTGA CTCCCCGTCG 
GAGTCGCTAG ACAGATAAAG CAAGTAGGTA TCAACGGACT GAGGGGCAGC 

TGTAGATAAC TACGATACGG GAGGGCTTAC CATCTGGCCC CAGTGCTGCA 
ACATCTATTG ATGCTATGCC CTCCCGAATG GTAGACCGGG GTCACGACGT 

ATGATACCGC. GAGACCCACG CTCACCGGCT CCAGATTTAT CAGCAATAAA 
TACTATGGCG CTCTGGGTGC GAGTGGCCGA GGTCTAAATA GTCGTTATTT 

CCAGCCAGCC GGAAGGGCCG AGCGCAGAAG TGGTCCTGCA ACTTTATCCG 
GGTCGGTCGG CCTTCCCGGC TCGCGTCTTC ACCAGGACGT TGAAATAGGC 

CCTCCATCCA GTCTATTAAT TGTTGCCGGG AAGCTAGAGT AAGTAGTTCG 
GGAGGTAGGT CAGATAATTA ACAACGGCCC TTCGATCTCA TTCATCAAGC 

CCAGTTAATA GTTTGCGCAA CGTTGTTGCC ATTGCTACAG GCATCGTGGT 
GGTCAATTAT CAAACGCGTT GCAACAACGG TAACGAT6TC CGTAGCACCA 

GTCACGCTCG TCGTTTGGTA TGGCTTCATT CAGCTCCGGT TCCCAACGAT 
CAGTGCGAGC AGCAAACCAT ACCGAAGTAA GTCGAGGCCA AGGGTTGCTA 

CAAGGCGAGT TACATGATCC CCCAT6TTGT GCAAAAAAGC GGTTAGCTCC 
GTTCCGCTCA AT6TACTAGG GGGTACAACA CGTTTTTTCG CCAATCGAGG 

TTCGGTCCTC CGATCGTTGT CAGAAGTAAG TTGGCCGCAG TGTTATCACT 
AAGCCAGGAG GCTAGCAACA GTCTTCATTC AACCGGCGTC ACAATAGTGA 
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37951 GATGCTTTTC TGTGACTGGT GAGTACTCAA CCAAGTCATT CTGAGAATAG 
CTACGAAAAG ACACTGACCA CTCATGAGTT GGTTCAGTAA GACTCTTATC 

38001 TGTATGCGGC GACCGAGTTG CTCTTGCCCG GCGTCAACAC GGGATAATAC 
ACATACGCCG CTGGCTCAAC GAGAACGGGC CGCAGTTGTG CCCTATTATG 

38051 CGCGCCACAT AGCAGAACTT TAAAAGTGCT CATCATTGGA AAACGTTCTT 
GCGCGGTGTA TCGTCTTGAA ATTTTCACGA GTAGTAACCT TTTGCAAGAA 

38101 CGGGGCGAAA ACTCTCAAGG ATCTTACCGC TGTTGAGATC CAGTTCGATG 
GCCCCGCTTT TGAGAGTTCC TAGAATGGCG ACAACTCTAG GTCAAGCTAC 

38151 TAACCCACTC GTGCACCCAA CTGATCTTCA GCATCTTTTA CTTTCACCAG 
ATTGGGTGAG.CACGTGGGTT GACTAGAAGT GGTAGAAAAT GAAAGTGGTC 

38201 CGTTTCTGGG TGAGCAAAAA CAGGAAGGCA AAATGCCGCA AAAAAGGGAA 
GCAAAGACCC ACTCGTTTTT GTCCTTCCGT TTTACGGCGT TTTTTCCCTT 

38251 TAAGGGCGAC ACGGAAATGT TGAATACTCA TACTCTTCCT TTTTCAATAT 
ATTCCCGCTG TGCCTTTACA ACTTATGAGT ATGAGAAGGA AAAAGTTATA 

38301 TATTGAAGCA TTTATCAGGG TTATTGTCTC ATGAGCGGAT ACATATTTGA 
ATAACTTCGT AAATAGTCCC AATAACAGAG TACTCGCCTA TGTATAAACT 

38351 ATGTATTTAG AAAAATAAAC AAATAGGGGT TCCGCGCACA TTTCCCCGAA 
TACATAAATC TTTTTATTTG TTTATCCCCA AGGCGCGTGT AAAGGGGCTT 

38401 AAGTGCCACC TGACGTCTAA GAAACCATTA TTATCATGAC ATTAACCTAT 
TTCACGGTGG ACTGCAGATT CTTTGGTAAT AATAGTACTG TAATTGGATA 

38451 AAAAATAGGC GTATCACGAG GCCCTTTCGT CTTCAAGAAT TGGATCCGAA 
TTTTTATCCG CATAGTGCTC CGGGAAAGCA GAAGTTCTTA ACCTAGGCTT 



Pad 



38501 



TTCTTAATTT CTTAATTAA 
AAGAATTAAA GAATTAATT 



(SEQ ID NO: 32) 
(SEO ID NO: 33) 
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1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 



CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG 
GTAGTAGTTA TTATATGGAA TAAAACCTAA CTTCGGTTAT ACTATTACTC 

GGGGTGGA6T TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG 
CCCCACCTCA AACACTGCAC CGCGCCCCGC ACCCTTGCCC CGCCCACTGC 

TAGTAGTGTG GCGGAAGTGT GATGTTGCAA GTGTGGCGGA ACACATGTAA 
ATCATCACAC CGCCTTCACA CTACAACGTT CACACCGCCT TGTGTACATT 

GCGACGGATG TGGCAAAAGT GACGTTTTTG GTGTGCGCCG GTGTACACAG 
CGCTGCCTAC ACCGTTTTCA CTGCAAAAAC CACACGCGGC CACATGTGTC 

GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG TAAATTTGGG 
CTTCACTGTT AAAAGCGCGC CAAAATCCGC CTACAACATC ATTTAAACCC 

CGTAACCGAG TAAGATTTGG CCATTTTCGCc GGGAAAACTG AATAAGAGGA 
GCATTGGCTC ATTCTAAACC GGTAAAAGCG CCCTTTTGAC TTATTCTCCT 

AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA 
TCACTTTAGA CTTATTAAAA CACAATGAGT ATCGCGCATT ATAAACAGAT 

GGGCCGCGGG GACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT 
CCCGGCGCCC CTGAAACTGG CAAATGCACC TCTGAGCGGG TCCACAAAAA 

CTCAGGTGTT TTCCGCGTTC CGGGTCAAAG TTGGCGTTTT ATTATTATAG 
GAGTCCACAA AAGGCGCAAG GCCCAGTTTC AACCGCAAAA TAATAATATC 

GCGGCCGCGA TCCATTGCAT ACGTTGTATC CATATCATAA TATGTACATT 
CGCCGGCGCT AGGTAACGTA TGCAACATAG GTATAGTATT ATACATGTAA 

TATATTGGCT CATGTCCAAC ATTACCGCCA TGTTGACATT GATTATTGAC 
ATATAACCGA GTACAGGTTG TAATGGCGGT ACAACTGTAA CTAATAACTG 

TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 
ATCAATAATT ATCATTAGTT AATGCCCCAG TAATCAAGTA TCGGGTATAT 

TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG 
ACCTCAAGGC GCAATGTATT GAATGCCATT TACCGGGCGG ACCGACTGGC 

CCCAACGACC CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT 
GGGTTGCTGG GGGCGGGTAA CTGCAGTTAT TACTGCATAC AAGGGTATCA 

AACGCCAATA GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT 
TTGCGGTTAT CCCTGAAAGG TAACTGCAGT TACCCACCTC ATAAATGCCA 

AAACTGCCCA CTTGGCA6TA CATCAAGTGT ATCATATGCC AAGTACGCCC 
TTTGACGGGT GAACC6TCAT GTAGTTCACA TAGTATACGG TTCATGCGGG 

CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA 
GGATAACTGC AGTTACTGCC ATTTACCGGG CGGACCGTAA TACGGGTCAT 
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901 TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA 
AGCGATAATG GTACCACTAC GCCAAAACCG TCATGTAGTT ACCCGCACCT 

951 TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 
ATCGCCAAAC TGAGTGCCCC TAAAGGTTCA GAGGTGGGGT AACTGCAGTT 

1001 TGGGAGTTTG TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA 
ACCCTCAAAC AAAACCGTGG nTTAGTTGC CCTGAAAGGT TTTACAGCAT 

1051 ACAACTCCGC CCCATTGACG CAAATGGGCG GTAGGCGTGT ACG6TGGGAG 
TGTTGAGGCG GGGTAACTGC 6TTTACCCGC CATCCGCACA TGCCACCCTC 

1101 GTCTATATAA GCAGAGCTCG TTTAGTGAAC CGTCAGATCG CCTGGAGACG 
CAGATATATT CGTCTCGAGC AAATCACTTG GCAGTCTAGC GGACCTCTGC 

1151 CCATCCACGC TGTTTTGACC TCCATAGAAG ACACCGGGAC CGATCCAGCC 
GGTAGGTGCG ACAAAACTGG AGGTATCTTC TGTGGCCCTG GCTAGGTCGG 

1201 TCCGCGGCCG GGAACGGTGC ATTGGAACGC GGATTCCCCG TGCCAAGAGT . 
AGGCGCCGGC CCTTGCCACG TAACCTTGCG CCTAAGGGGC ACGGTTCTCA 

1251 GAGATCTGCC ACCATGGCqC GCAAGTGGTC CAAGAGGTCC GTGCCCGGCT 
CTCTAGACGG TGGTACCGGC CGTTCACCAG GTTCTCCAGG CACGGGCCGA 

1301 GGTCCACCGT GAGGGAGAGG ATGAGGAGGG CCGAGCCCGC CGCCGACAGG 
CCAGGTGGCA CTCCCTCTCC TACTCCTCCC GGCTCGGGCG GCGGCTGTCC 

1351 GTGAGGAGGA CCGAGCCCGC CGCAGTGGGC GTGGGCGCCG TGTCCAGGGA 
CACTCCTCCT GGCTCGGGCG GCGTCACCCG CACCCGCGGC ACAGGTCCCT 

1401 CCTGGAGAAG CACGGCGCCA TCACCTCCTC CAACACCGCC GCCACCAACG 
GGACCTCTTC GTGCCGCGGT AGTGGAGGAG GTT6TGGCGG CGGTGGTTGC 

1451 CCGACTGCGC CTGGCTGGAG GCCCAGGAGG ACGAGGAGGT GGGCTTCCCC 
GGCTGACGCG GACCGACCTC CGGGTCCTCC TGCTCCTCCA CCCGAAGGGG 

1501 GTGAGGCCCC AGGTGCCCCT GAGGCCCATG ACCTACAAGG GCGCCGTGGA 
CACTCCGGGG TCCACGGGGA CTCCGGGTAC TpGATGTTCC CGCGGCACCT 

1551 CCTGTCCCAC TTCCTGAAGG AGAAGGGCGG CCTGGAGGGC CTGATCCACT 
GGACAGGGTG AAGGACTTCC TCTTCCCGCC GGACCTCCCG GACTAGGTGA 

1601 CCCAGAAGAG GCAGGACATC CTGGACCTGT GGGTGTACCA CACCCAGGGC 
GGGTCTTCTC CGTCCTGTAG GACCTGGACA CCCACATGGT GTGGGTCCCG 

1651 TACTTCCCCG ACTGGCAGAA CTACACCCCC GGCCCCGGCA TCAGGTTCCC 
ATGAAGGGGC TGACCGTCTT GATGTGGGGG CCGGGGCCGT AGTCCAAGGG 

1701 CCTGACCTTC GGCTGGTGCT TCAAGCTGGT GCCCGTGGAG CCCGAGAAGG 
GGACTGGAAG CCGACCACGA AGTTCGACCA CGGGCACCTC GGGCTCTTCC 

1751 TGGAGGAGGC CAACGAGGGC GAGAACAACT GCGCCGCCCA CCCCATGTCC 
ACCTCCTCCG GTTGCTCCCG CTCTTGTTGA CGCGGCGGGT GGGGTACAGG 
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1851 CTCCAAGCTG GCCTTCCACC ACGTGGCCAG GGAGCTGCAC CCCGAGTACT 
GAGGTTCGAC CGGAAGGTGG TGCACCGGTC CCTCGACGTG GGGCTCATGA 

1901 ACAAGGACTG CTAAAGCCCG GGCAGATCTG CTGTGCCTTC TAGTTGCCAG 
TGTTCCTGAC GATTTCGGGC CCGTCTAGAC GACACGGAAG ATCAACGGTC 

1951 CCATCTGTTG TTTGCCCCTC CCCCGTGCCT TCCTTGACCC TGGAAGGTGC 
GGTAGACAAC AAACGGGGAG GGGGCACGGA AGGAACTGGG ACCTTCCACG 

2001 CACTCCCACT GTCCTTTCCT AATAAAATGA GGAAATTGCA TCGCATTGTC 
GTGAGGGTGA CAGGAAAGGA TTATTTTACT CCTTTAACGT AGCGTAACAG 

2051 TGAGTAGGTG TCATTCTATT CTGGGGGGTG GGGTGGGGCA GGACAGCAAG 
ACTCATCCAC AGTAAGATAA GACCCCCCAC CCCACCCCGT CCTGTCGTTC 

2101 GGGGAGGATT GGGAAGACAA TAGCAGGCAT GCTGGGGATG CGGTGGGCTC 
CCCCTCCTAA CCCTTCTGTT ATCGTCCGTA CGACCCCTAC GCCACCCGAG 

2151 TATGGCCGAT CGGCGCGCCG TACTGAAATG TGTGGGCGTG GCTTAAGGGT 
ATACCGGCTA GCCGCGCGGC ATGACTTTAC ACACCCGCAC CGAATTCCCA 

2201 GGGAAAGAAT ATATAAGGTG GGGGTCTTAT GTAGTTTTGT ATCTGTTTTG 
CCCTTTCTTA TATATTCCAC CCCCAGAATA CATCAAAACA TAGACAAAAC 

2251 CAGCAGCCGC CGCCGCCATG AGCACCAACT CGTTTGATGG AAGCATTGTG 
GTCGTCGGCG GCGGCGGTAC TCGTGGTTGA GCAAACTACC TTCGTAACAC 

2301 A3CTCATATT TGACAACGCG CATGCCCCCA TGGGCCGGGG TGCGTCAGAA 
TCGAGTATAA ACTGTTGCGC GTACGGGGGT ACCCG6CCCC ACGCAGTCTT 

2351 TGTGATGGGC TCCAGCATTG ATGGTCGCCC CGTCCTGCCC GCAAACTCTA 
ACACIACCCG AGGTCGTAAC TACCAGCGGG GCAGGACGGG CGTTTGAGAT 

2401 CTACCTTGAC CTACGAGACC GTGTCTGGAA CGCCGTTGGA GACTGCAGCC 
GATGCSAACTG GATGCTCTGG CACAGACCTT GCGGCAACCT CTGACGTCGG 

2451 TCCGCCGCCG CTTCAGCCGC TGCAGCCACC GCCCGCGGGA TTGTGACTGA 
AGGCGGCGGC GAAGTCGGCG ACGTCGGTGG CGGGCGCCCT AACACTGACT 

2501 CTTTGCTTTC CTGAGCCCGC TTGCAAACAG TGCAGCTTCC CGTTCATCCG 
GAAACGAAAG GACTCGGGCG AACGTTTGTC ACGTCGAAGG GCAAGTAGGC 

2551 CCCGCGATGA CAAGTTGACG GCTCTTTTGG CACAATTGGA TTCTTTGACC 
GGGCGCTACT GTTCAACTGC CGAGAAAACC GTGTTAACCT AAGAAACTGG 

2601 CGGGAACTTA ATGTCGTTTC TCAGCA6CTG TTGGATCTGC GCCAGCAGGT 
GCCCTTGAAT TACAGCAAAG AGTCGTCGAC AACCTAGACG CGGTCGTCCA 

2651 TTCTGCCCTG AAGGCTTCCT CCCCTCCCAA TGCGGTTTAA AACATAAATA 
AAGACGGGAC TTCCGAAGGA GGGGAGGGTT ACGCCAAATT TTGTATTTAT 

2701 AAAAACCAGA CTCTGTTTGG ATTTGGATCA AGCAAGTGTC TTGCTGTCTT 
TTTTTGGTCT GAGACAAACC TAAACCTAGT TCGTTCACAG AACGACAGAA 
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2751 tatttagggg ttttgcgcgc gcggtaggcc cgggaccagc ggtctcggtc 
ataaatcccc aaaacgcgcg cgccatccgg gccctggtcg ccagagccag 

2801 gttgagggtc ctgtgtattt tttccaggac gtggtaaagg tgactctgga 
caactcccag gacacataaa aaaggtcctg caccatttcc actgagacct 

2851 tgttcagata catgggcata agcccgtctc tggggtggag gtagcaccac 
acaagtctat gtacccgtat tcgggcagag accccacctc catcgtggtg 

2901 tgcagagctt catgctgcgg ggtggtgttg tagatgatcc agtcgtagca 
acgtctcgaa gtacgacgcc ccaccacaac atctactagg tcagcatcgt 

2951 ggagcgctgg gcgtggtgcc taaaaatgtc tttcagtagc aagctgattg 
cctcgcgacc cgcaccacgg atttttacag aaagtcatcg ttcgactaac 

3001 ccaggggcag gcccttggtg taagtgttta caaagcisgtt aagctgggat 
ggtccccgtc cgggaaccac attcacaaat gtttcgccaa ttcgacccta 

3051 gggtgcatac gtggggatat gagatgcatc ttggactgta tttttaggtt 
cccacgtatg cacccctata ctctacgtag aacctgacat aaaaatccaa 

3101 ggctatgttc ccagccatat ccctccgggg attcatgttg tgcagaacca 
ccgatacaag ggtcggtata gggaggcccc taagtacaac acgtcttggt 

3151 ccagcacagt gtatccggtg cacttgggaa atttgtcatg tagcttagaa 
ggtcgtgtca cataggccac gtgaaccctt taaacagtac atcgaatctt 

3201 ggaaatgcgt ggaagaactt ggagacgccc ttgtgacctc caagattttc 
cctttacgca ccttcttgaa cctctgcggg aacactggag gttctaaaag 

3251 catgcattcg tccataatga tggcaatggg cccacgggcg gcggcctggg 
gtacgtaagc aggtattact accgttaccc gggtgcccgc cgccggaccc 

3301 cgaagatatt tctgggatca ctaacgtcat agttgtgttc caggatgaga 
gcttctataa agaccctagt gattgcagta tcaacacaag gtcctactct 

3351 tcgtcatagg ccatttttac aaagcgcggg cggagggtgc cagactgcgg 
agcagtatcc 6gtaaaaatg tttcgcgccc gcctcccacg gtctgacgcc 

3401 tat;*j^tggtt ccatccggcc caggggcgta gttaccctca cagatttgca 
atattaccaa ggtaggccgg gtccccgcat caatgggagt gtctaaacgt 

3451 tttcccacgc tttgagttca gatgggggga tcatgtctac ctgcggggcg 
aaagggtgcg aaactcaagt ctacccccct agtacagatg gacgccccgc 

3501 atgaagaaaa cggtttccgg ggtaggggag atcagctggg aagaaagcag 
tacttctttt gccaaaggcc ccatcccctc tagtcgaccc ttctttcgtc 

3551 gttcctgagc agctgcgact taccgcagcc ggtgggcccg taaatcacac 

CAAGGACTCG TCGACGCTGA ATGGCGTCGG CCACCCGGGC ATTTAGTGTG 

3601 CTATTACCGG CTGCAACTGG TAGTTAAGAG AGCTGCAGCT GCCGTCATCC 
GATAATGGCC GACGTTGACC ATCAATTCTC TCGACGTCGA CGGCAGTAGG 

3651 CTGAGCAGGG GGGCCACTTC GTTAAGCATG TCCCTGACTC GCATGTTTTC 
GACTCGTCCC CCCGGTGAAG CAATTCGTAC AGGGACTGAG CGTACAAAAG 
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3701 CCTGACCAAA TCCGCCAGAA GGCGCTCGCC GCCCAGCGAT AGCAGTTCTT 
GGACTGGTTT AGGCGGTCTT CCGCGAGCGG CGGGTCGCTA TCGTCAAGAA 

3751 GCAAGGAAGC AAAGTTTTTC AACGGTTTGA GACCGTCCGC CGTAGGCATG 
CGTTCCTTCG TTTCAAAAAG TTGCCAAACT CTGGCAGGCG GCATCCGTAC 

3801 CTTTTGAGCG TTTGACCAAG CAGTTCCAGG CGGTCCCACA GCTCGGTCAC 
GAAAACTCGC AAACTGGTTC GTCAAGGTCC GCCAGGGTGT C6AGCCAGTG 

3851 CTGCTCTACG GCATCTCGAT CCAGCATATC TCCTCGTTTC GCGGGTTGGG 
GACGAGATGC CGTAGAGCTA GGTCGTATAG AGGAGCAAAG CGCCCAACCC 

3901 GCGGCTTTCG CTGTACGGCA GTAGTCGGTG CTCGTCCAGA CGGGCCAGGG 
CGCCGAAAGC GACATGCCGT CATCAGCCAC GAGCAGGTCT GCCCGGTCCC 

3951 TCATGTCTTT CCACGGGCGC AGGGTCCTCG TCAGCGTAGT CTGGGTCACG 
AGTACAGAAA GGTGCCCGCG TCCCAGGAGC AGTCGCATCA GACCCAGTGC 

4001 GTGAAGGGGT GCGCTCCGGG CTGCGCGCTG GCCA6GGTGC GCTTGAGGCT 
CACTTCCCCA CGCGAGGCCC GACGCGCGAC CGGTCCCACG CGAACTCCGA 

4051 GGTCCTGCTG GTGCTGAAGC GCTGCCGGTC TTCGCCCTGC GCGTCGGCCA 
CCAGGACGAC CACGACTTCG CGACGGCCAG AAGCGGGACG CGCAGCCGGT 

4101 GGTAGCATTT GACCATGGTG TCATAGTCCA GCCCCTCCGC GGCGTGGCCC 
CCATCGTAAA CTGGTACCAC AGTATCAGGT CGGGGAGGCG CCGCACCGGG 

4151 TTGGCGCGCA GCTTGCCCTT GGAGGAGGCG CCGCACGAGG GGCAGTGCAG 
AACCGCGCGT CGAACGGGAA CCTCCTCCGC GGCGTGCTCC CCGTCACGTC 

4201 ACTTTTGAGG GCGTAGAGCT TGGGCGCGAG AAATACCGAT TCCGGGGAGT 
TGAAAACTCC CGCATCTCGA ACCCGCGCTC TTTATGGCTA AGGCCCCTCA 

4251 AGGCATCCGC GCCGCAGGCC CCGCAGACGG TCTCGCATTC CACGAGCCAG . 
TCCGTAGGCG CGGCGTCCGG GGCGTCTGCC AGAGCGTAAG GTGCTCGGTC 

4301 GTGAGCTCTG GCCGTTCGGG GTCAAAAACC AGGTTTCCCC CATGCTTTTT 
CACTCGAGAC CGGCAAGCCC CAGTTTTTGG TCCAAAGGGG GTACGAAAAA 

4351 GATGCGTTTC TTACCTCTGG TTTCCATGAG CCG5TGTCCA CGCTCGGTGA 
CTACGCAAAG AATGGAGACC AAAGGTACTC GGCCACAGGT GCGAGCCACT 

4401 CGAAAAGGCT GTCCGTGTCC CCGTATACAG ACTTGAGAGG CCTGTCCTCG 
GCTTTTCCGA CAGGCACAGG GGCATATGTC TGAACTCTCC GGACAGGAGC 

4451 AGCGGTGTTC CGCGGTCCTC CTCGTATAGA AACTCGGACC ACTCTGAGAC 
TCGCCACAAG GCGCCAGGAG GAGCATATCT TTGAGCCTGG TGAGACTCTG 

4501 AAAGGCTCGC GTCCAGGCCA GCACGAAGGA GGCTAAGTGG GAGGGGTAGC 
TTTCCGAGCG CAGGTCCGGT CGTGCTTCCT CCGATTCACC CTCCCCATCG 

4551 GGTCGTTGTC CACTAGGGGG TCCACTCGCT CCAGGGTGTG AA6ACACATG 
CCAGCAACAG GTGATCCCCC AGGTGAGCGA GGTCCCACAC TTCTGTGTAC 

4601 TCGCCCTCTT CGGCATCAAG GAAGGTGATT GGTTTGTAGG TGTAGGCCAC 
AGCGG6AGAA GCCGTAGTTC CTTCCACTAA CCAAACATCC ACATCCGGTG 
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4701 CGTCCTCACT CTCTTCCGCA TCGCTGTCTG CGAGGGCCAG CTGTTGGGGT 
GCAGGAGTGA GAGAAGGCGT AGCGACAGAC GCTCCCGGTC GACAACCCCA 

47 51 GAGTACrCCC TCTGAAAAGC GGGCATGACT TCTGCGCTAA GATTGTCAGT 
CTCATGAGGG AGACTTTTCG CCCGTACTGA AGACGCGATT CTAACAGTCA 

4801 TTCCAAAAAC GAGGAGGATT TGATATTCAC CTGGCCCGCG GTGATGCCTT 
AAGGTTTTTG CTCCTCCTAA ACTATAAGTG GACCGGGCGC CACTACGGAA 

4851 TGAGGGTGGC CGCATCCATC TGGTCAGAAA AGACAATCTT TTTGTTGTCA 
ACTCCCACCG GCGTAGGTAG ACCAGTCTTT TCTGTTAGAA AAACAACAGT 

4901 AGCTTGGTGG CAAACGACCC GTAGAGGGCG TTGGACAGCA ACTTGGCGAT 
TCGAACCACC GTTTGCTGGG CATCTCCCGC AACCTGTCGT TGAACCGCTA 

4951 GGAGCGCAGG GTTTGGTTTT TGTCGCGATC GGCGCGCTCC TTGGCCGCGA 
CCTCGCGTCC CAAACCAAAA ACAGCGCTAG CCGCGCGAGG AACCGGCGCT 

5001 TGTTTAGCTG CACGTATTCG CGCGCAACGC ACCGCCATTC GGGAAAGACG 
ACAAATCGAC GTGCATAAGC GCGCGTTGCG TGGCGGTAAG CCCTTTCTGC 

5051 GTGGTGCGCT CGTCGGGCAC CAGGTGCACG CGCCAACCGC GGTTGTGCAG 
CACCACGCGA GCAGCCCGTG GTCCACGTGC GCGGTTGGCG CCAACACGTC 

5101 GGTGACAAGG TCAACGCTGG TGGCTACCTC TCCGCGTAGG CGCTCGTTGG 
CCACTGTTCC AGTTGCGACC ACCGATGGAG AGGCGCATCC GCGAGCAACC 

5151 TCCAGCAGAG GCGGCCGCCC TTGCGCGAGC AGAATGGCGG TAGGGGGTCT 
AGGTCGTCTC CGCCGGCGGG AACGCGCTCG TCTTACCGCC ATCCCCCAGA 

5201 AGCTGCGTCT COTCCGGGGG GTCTGCGTCC ACGGTAAAGA CCCCGGGCAG 
TCGACGCAGA GCAGGCCCCC CAGACGCAGG TGCCATTTCT GGGGCCCGTC 

5251 CAGGCGCGCG TCGAAGTAGT CTATCTTGCA TCCTTGCAAG TCTAGCGCCT 
GTCCGCGCGC AGCTTCATCA GATAGAACGT AGGAACGTTC AGATCGCGGA 

5301 GCTGCCATGC GCGGGCGGCA AGCGCGCGCT CGTATGGGTT GAGTGGGGGA 
CGACGGTACG CGCCCGCCGT TCGCGCGCGA GCATACCCAA CTCACCCCCT 

5351 CCCCATGGCA TGGGGTGGGT GAGCGCGGAG GCGTACATGC CGCAAATGTC 
GGGGTACCGT ACCCCACCCA CTCGCGCCTC CGCATGTACG GCGTTTACAG 

5401 GTAAACGTAG AGGGGCTCTC TGAGTATTCC AAGATATGTA GGGTAGCATC 
CATTTGCATC TCCCCGAGA6 ACTCATAAGG TTCTATACAT CCCATCGTAG 

5451 TTCCACCGCG GATGCTGGCG CGCACGTAAT CGTATAGTTC GTGCGAGGGA 
AAGGTGGCGC CTACGACCGC GCGTGCATTA GCATATCAA6 CACGCTCCCT 

5501 GCGAGGAGGT CGGGACCGAG GTTGCTACGG GCGG6CTGCT CTGCTCG6AA 
CGCTCCTCCA GCCCTGGCTC CAACGATGCC CGCCCGACGA GACGAGCCTT 

5551 GACTATCTGC CTGAAGATGG CATGTGAGTT GGATGATATG GTTGGACGCT 
CTGATAGACG GACTTCTACC GTACACTCAA CCTACTATAC CAACCTGCGA 
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56S1 GAGGCGTAGG AGTCGCGCAG CTTGTTGACC AGCTCGGCGG TGACCTGCAC 
CTCCGCATCC TCAGCGCGTC GAACAACTGG TCGAGCCGCC ACTGGACGTG 

5701 GTCTAGGGCG CAGTAGTCCA GGGTTTCCTT GATGATGTCA TACTTATCCT 
CAGATCCCGC GTCATCAGGT CCCAAAGGAA CTACTACAGT ATGAATAGGA 

5751 GTCCCTTTTT TTTCCACAGC TCGCGGTTGA 6GACAAACTC TTCGCGGTCT 
CAGGGAAAAA AAAGGTGTCG AGCGCCAACT CCTGTTTGAG AAGCGCCAGA 

5801 TTCCAGTACT CTTGGATCGG AAACCCGTCG GCCTCCGAAC GGTAAGAGCC 
AAGGTCATGA GAACCTAGCC TTTGGGCAGC CGGAGGCTTG CCATTCTCGG 

5851 TAGCATGTAG AACTGGTTGA CGGCCTGGTA GGCGCAGCAT CCCTTTTCTA 
ATCGTACATC TTGACCAACT GCCGGACCAT CCGCGTCGTA GGGAAAAGAT 

5901 CGGGTAGCGC GTATGCCTGC GCGGCCTTCC GGAGCGAGGT GTGGGTGAGC 
GCCCATCGCG CATACGGACG CGCCGGAAGG CCTCGCTCCA CACCCACTCG 

5951 GCAAAGGTGT CCCTGACCAT GACTTTGAGG TACTGGTATT TGAAGTCAGT 
CGTTTCCACA GGGACTGGTA CTGAAACTCC ATGACCATAA ACTTCAGTCA 

6001 GTCGTCGCAT CCGCCCTGCT CCCAGAGCAA AAAGTCCGTG CGCTTTTTGG 
CAGCAGCGTA GGCGGGACGA GGGTCTCGTT TTTCAGGCAC GCGAAAAACC 

6051 AACGCGGATT TGGCAGGGCG AAGGTGACAT CGTTGAAGAG TATCTTTCCC 
TTGCGCCTAA ACCGTCCCGC TTCCACTGTA GCAACTTCTC ATAGAAAGGG 

6101 GCGCGAGGCA TAAAGTTGCG TGTGATGCGG AAGGGTCCCG GCACCTCGGA 
CGCGCTCCGT ATTTCAACGC ACACTACGCC TTCCCAGGGC CGTGGAGCCT 

6151 ACGGTTGTTA ATTACCTGGG CGGCGAGCAC GATCTCGTCA AAGCCGTTGA 
TGCCAACAAT TAATGGACCC GCCGCTCGTG CTAGAGCAGT TTCGGCAACT 

6201 TGTTGTGGCC CACAATGTAA AGTTCCAAGA AGCGCGGGAT GCCCTTGATG 
ACAACACCGG GTGTTACATT TCAAGGTTCT TCGCGCCCTA CGGGAACTAC 

6251 GAAGGCAATT TTTTAAGTTC CTCGTAGGTG AGCTCTTCAG GGGA6CTGAG 
CTTCCGTTAA AAAATTCAAG GAGCATCCAC TCGAGAAGTC CCCTCGACTC 

6301 CCCGTGCTCT GAAAGGGCCC AGTCTGCAAG ATGAGGGTTG GAAGCGACGA 
GGGCACGAGA CTTTCCCGGG TCAGACGTTC TACTCCCAAC CTTCGCTGCT 

6351 ATGAGCTCCA CAGGTCACGG GCCATTAGCA TTTGCAGGTG GTCGCGAAAG 
TACTCGAGGT GTCCAGTGCC CGGTAATCGT AAACGTCCAC CAGCGCTTTC 

6401 GTCCTAAACT GGCGACCTAT GGCCATTTTT TCTGGGGTGA TGCAGTAGAA 
CAGGATTTGA CCGCTGGATA CCGGTAAAAA AGACCCCACT ACGTCATCTT 

6451 GGTAAGCGGG TCTTGTTCCC AGCGGTCCCA TCCAAGGTTC GCGGCTAGGT 
CCATTCGCCC AGAACAAGGG TCGCCAGGGT AGGTTCCAAG CGCCGATCCA 

6501 CTCGCGCGGC AGTCACTAGA GGCTCATCTC CGCCGAACTT CATGACCAGC 
GAGCGCGCCG TCAGTGATCT CCGAGTAGAG GCGGCTTGAA GTACTGGTCG 
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TACATCGTAG GTGACAAAGA GACGCTCGGT GCGAGGATGC GAGCCGATCG 
ATGTAGCATC CACTGTTTCT CTGCGAGCCA CGCTCCTACG CTCGGCTAGC 

GGAAGAACTG GATCTCCCGC CACCAATTGG AGGAGTGGCT ATTGATGTGG 
CCTTCTTGAC CTAGAGGGCG GTGGTTAACC TCCTCACCGA TAACTACACC 

TGAAAGTAGA AGTCCCTGCG ACGGGCCGAA CACTCGTGCT GGCTTTTGTA 
ACTTTCATCT TCAGGGACGC TGCCCGGCTT GTGAGCACGA CCGAAAACAT 

AAAACGTGCG CAGTACTGGC AGCGGTGCAC GGGCTGTACA TCCTGCACGA 
TTTTGCACGC GTCATGACCG TCGCCACGTG CCCGACATGT AGGACGTGCT 

, GGTTGACCTG ACGACCGCGC ACAAGGAAGC AGAGTGGGAA TTTGAGCCCC 
CCAACTGGAC TGCTGGCGCG TGTTCCTTCG TCTCACCCTT AAACTCGGGG 

TCGCCTGGCG GGTTTGGCTG GTGGTCTTCT ACTTCGGCTG CTTGTCCTTG 
AGCGGACCGC CCAAACCGAC CACCAGAAGA TGAAGCCGAC GAACAGGAAC 

ACCGTCTGGC TGCTCGAGGG GAGTTACGGT GGATCGGACC ACCACGCCGC 
TGGCAGACCG ACGAGCTCCC CTCAATGCCA CCTAGCCTTG TGGTGCGGCG 

GCGAGCCCAA AGTCCAGATG TCCGCGCGCG GCGGTCGGAG CTTGATGACA 
CGCTCGGGTT TCAGGTCTAC AGGCGCGCGC CGCCAGCCTC GAACTACTGT 

ACATCGCGCA GATGGGAGCT GTCCATGGTC TGGAGCTCCC GCGGCGTCAG 
TGTAGCGCGT CTACCCTCGA CAGGTACCAG ACCTCGAGGG CGCCGCAGTC 

G.TCAGGCGGG AGCTCCTGCA GGTTTACCTC GCATAGACGG GTCAGGGCGC 
CAGTCCGCCC TCGAGGACGT CCAAATGGAG CGTATCTGCC CAGTCCCGCG 

GGGCTAGATC CAGGTGATAC CTAATTTCCA GGGGCTGGTT GGTGGCGGCG 
CCCGATCTAG GTCCACTATG GATTAAAGGT CCCCGACCAA CCACCGCCGC 

TCGATGGCTT GCAAGAGGCC GCATCCCCGC GGCGCGACTA CGGTACCGCG 
AGCTACCGAA CGTTCTCCGG CGTAGGGGCG CCGCGCTGAT GCCATGGCGC 

CGGCGGGCGG TGGGCCGCGG GGGTGTCCTT GGATGATGCA TCTAAAAGCG 
GCCGCCCGCC ACCCGGCGCC CCCACAGGAA CCTACTACGT AGATTTTCGC 

GTGACGCGGG. CGAGCCCCCG GAGGTAGGGG GGGCTCCGGA CCCGCCGGGA 
CACTGCGCCC GCTCGGGGGC CTCCATCCCC CCCGAGGCCT GGGCGGCCCT 

GAGGGGGCAG GGGCACGTCG GCGCCGCGCG CGGGCAGGAG CTGGTGCTGC 
CTCCCCCGTC CCCGTGCAGC CGCGGCGCGC GCCCGTCCTC GACCACGACG 

GCGCGTAGGT TGCTGGCGAA CGCGACGACG CGGCGGTTGA TCTCCTGAAT 
CGCGCATCCA ACGACCGCTT GCGCTGCTGC GCCGCCAACT AGAGGACTTA 

CTGGCGCCTC. TGC6TGAAGA CGACGGGCCC GGTGAGCTTG AACCTGAAAG 
GACCGCGGAG ACGCACTTCT GCTGCCCGGG CCACTCGAAC TTGGACTTTC 

AGAGTTCGAC AGAATCAATT TCGGTGTCGT TGACGGCGGC CTGGCGCAAA 
TCTCAAGCTG TCTTAGTTAA AGCCACAGCA ACTGCCGCCG GACCGCGTTT 




103/144 



wo 02/022080 



PCT/USOl/28861 



7551 CTGCTCGATC TCTTCCTCCT GGAGATCTCC GCGTCCGGCT CGCTCCACGG 
GACGAGCTAG AGAAGGAGGA CCTCTAGAGG CGCAGGCCGA GCGAGGTGCC 

7601 TGGCGGCGAG GTCGTTGGAA ATGCGGGCCA TGAGCTGCGA GAAGGCGTTG 
ACCGCCGCTC CAGCAACCTT TACGCCCGGT ACTCGACGCT CTTCCGCAAC 

7651 AGGCCTCCCT CGTTCCAGAC GCGGCTGTAG ACCACGCCCC CTTCGGCATC 
TCCGGAGGGA GCAAGGTCTG CGCCGACATC TGGTGCGGGG GAAGCCGTAG 

7701 GCGGGCGCGC ATGACCACCT GCGCGAGATT GAGCTCCACG TGCCGGGCGA 
CGCCCGCGCG TACTGGTGGA CGCGCTCTAA CTCGAGGTGC ACGGCCCGCT 

7751 AGACGGCGTA GTTTCGCAGG CGCTGAAAGA GGTAGTTGAG GGTGGTGGCG 
TCTGCCGCAT CAAAGCGTCC GCGACTTTCT CCATCAACTC CCACCACCGC 

7801 GTGTGTTCTG CCACGAAGAA GTACATAACC CAGCGTCGCA ACGTGGATTC 
CACACAAGAC GGTGCTTCTT CATGTATTGG GTCGCAGCGT TGCACCTAAG 

7851 GTTGATATCC CCCAAGGCCT CAAGGCGCTC CATGGCCTCG TAGAAGTCCA 
CAACTATAGG GGGTTCCGGA GTTCCGCGAG GTACCGGAGC ATCTTCAGGT 

7901 CGGCGAAGTT GAAAAACTGG GAGTTGCGCG CCGACACGGT TAACTCCTCC 
GCCGCTTCAA CTTTTTGACC CTCAACGCGC GGCTGTGCCA ATTGAGGAGG 

7951 TCCAGAAGAC GGATGAGCTC GGCGACAGTG TCGCGCACCT CGCGCTCAAA 
AGGTCTTCTG CCTACTCGAG CCGCTGTCAC AGCGCGTGGA GCGCGAGTTT 

8001 GGCTACAGGG GCCTCTTCTT CTTCTTCAAT CTCCTCTTCC ATAAGGGCCT 
CCGATGTCCC CGGAGAAGAA GAAGAAGTTA GAGGAGAAGG TATTCCCGGA 

8051 CCCCTTCTTC TTCTTCTGGC GGCGGTGGGG GAGGGGGGAC ACGGCGGCGA 
GGGGAAGAAG AAGAAGACCG CCGCCACCCC CTCCCCCCTG .TGCCGCCGCT 

8101 CGACGGCGCA CCGGGAGGCG GTCGACAAAG CGCTCGATCA TCTCCCCGCG 
GCTGCCGCGT GGCCCTCCGC CAGCTGTTTC GCGAGCTAGT AGAGGGGCGC 

8151 GCGACGGCGC ATGGTCTCGG TGACGGCGCG GCCGTTCTCG CGGGGGCGCA 
CGCTGCCGCG TACCAGAGCC ACTGCCGCGC CGGCAAGAGC GCCCCCGCGT 

8201 GTTGGAAGAC GCCGCCCGTC ATGTCCCGGT TATGGGTTGG CGGGGGGCTG 
CAACCTTCTG CGGCGGGCAG TACAGGGCCA ATACCCAACC GCCCCCCGAC 

8251 CCATGCGGCA GGGATACGGC GCTAACGATG CATCTCAACA ATTGTTGTGT 
GGTACGCCGT CCCTATGCCG CGATTGCTAC GTAGAGTTGT TAACAACACA 

8301 AGGTACTCCG CCGCCGAGGG ACCTGAGCGA GTCCGCATCG ACCGGATCGG 
TCCATGAGGC GGCGGCTCCC TGGACTCGCT CAGGCGTAGC TGGCCTAGCC 

8351 AAAACCTCTC GAGAAAGGCG TCTAACCAGT CACAGTCGCA AGGTAGGCTG 
TTTTGGAGAG CTCTTTCCGC AGATTGGTCA GTGTCAGCGT TCCATCCGAC 

8401 AGCACCGTGG CGGGCGGCAG CGGGCGGCGG TCGGGGTTGT TTCTGGCGGA 
TCGTGGCACC GCCCGCCGTC GCCCGCCGCC AGCCCCAACA AAGACCGCCT 
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8501 TCGACAGAAG CACCATGTCC TTGGGTCCGG CCTGCTGAAT GCGCAGGCGG 
A6CTGTCTTC GTGGTACAGG AACCCAGGCC GGACGACTTA CGCGTCCGCC 

8551 TCGGCCATGC CCCAGGCTTC GTTTTGACAT CGGCGCAGGT CTTTGTAGTA 
AGCCGGTACG GGGTCCGAAG CAAAACTGTA GCCGCGTCCA GAAACATCAT 

8601 6TCTTGCATG AGCCTTTCTA CCGGCACTTC TTCTTCTCCT TCCTCTTGTC 
CAGAACGTAC TCGGAAAGAT GGCCGTGAAG AAGAAGAGGA AGGAGAACAG 

8651 CTGCATCTCT TGCATCTATC GCTGCGGCGG CGGCGGAGTT TGGCCGTAGG 
GACGTAGAGA ACGTAGATAG CGACGCCGCC GCCGCCTCAA ACCGGCATCC 

8701 TGGCGCCCTC TTCCTCCCAT GCGTGTGACC CCGAAGCCCC TCATCGGCTG 
ACCGCGGGAG AAGGAGGGTA CGCACACTGG GGCTTCGGGG AGTAGCCGAC 

8751 AAGCAGGGCT AGGTCGGCGA CAACGCGCTC GGCTAATATG GCCTGCTGCA 
TTCGTCCCGA TCCAGCCGCT GTTGCGCGAG CCGATTATAC CGGACGACGT 

8801 CCTGCGTGAG GGTAGACTGG AAGTCATCCA TGTCCACAAA GCGGTGGTAT 
GGACGCACTC CCATCTGACC TTCAGTAGGT ACAGGTGTTT CGCCACCATA 

8851 GCGCCCGTGT TGATGGTGTA AGTGCAGTTG GCCATAACGG ACCAGTTAAC 
CGCGGGCACA ACTACCACAT TCACGTCAAC CGGTATTGCC TGGTCAATTG 

8901 GGTCTGGTGA CCCGGCTGCG AGAGCTCGGT GTACCTGAGA CGCGAGTAAG 
CCAGACCACT GGGCCGACGC TCTCGAGCCA CATGGACTCT GCGCTCATTC 

8951 CCCTCGAGTC AAATACGTAG TCGTTGCAAG TCCGCACCAG GTACTGGTAT 
GGGAGCTCAG TTTATGCATC AGCAACGTTC AGGCGTGGTC CATGACCATA 

9001 CCCACCAAAA AGTGCGGCGG CGGCTGGCGG TAGAGGGGCC AGCGTAGGGT 
GGGTGGTTTT TCACGCCGCC GCCGACCGCC ATCTCCCCGG TCGCATCCCA 

9051 GGCCGGGGCT CCGGGGGCGA GATCTTCCAA CATAAGGCGA TGATATCCGT 
CCGGCCCCGA GGCCCCCGCT CTAGAAGGTT GTATTCCGCT ACTATAGGCA 

9101 AGATGTACCT GGACATCCAG GTGATGCCGG CGGCGGTGGT GGAGGCGCGC 
TCTACATGGA CCTGTAGGTC CACTACGGCC GCCGCCACCA CCTCCGCGCG 

9151 GGAAAGTCGC GGACGCGGTT CCAGATGTTG CGCAGCGGCA AAAAGTGCTC 
CCTTTCAGCG CCTGCGCCAA GGTCTACAAC GCGTCGCCGT TTTTCACGAG 

9201 CATGGTCGGG ACGCTCTGGC CGGTCAGGCG CGCGCAATCG TTGACGCTCT. 
GTACCAGCCC TGCGAGACCG GCCAGTCCGC GCGCGTTAiSC AACTGCGAGA 

9251 AGACCGTGCA AAAGGAGAGC CTGTAAGCGG GCACTCTTCC GTGGTCTGGT 
TCTGGCACGT TTTCCTCTCG GACATTCGCC CGTGAGAAGG CACCAGACCA 

9301 GGATAAATTC GCAAGGGTAT CATGGCGGAC GACCGGGGTT CGAGCCCCGT 
CCTATTTAAG CGTTCCCATA GTACCGCCTG CTGGCCCCAA GCTCGGGGCA 

9351 ATCCGGCCGT CCGCCGTGAT CCATGCGGTT ACCGCCCGCG TGTCGAACCC 
TAGGCCGGCA GGCGGCACTA GGTACGCCAA TGGCGGGCGC ACAGCTTGGG 
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GGCGCGGCGG CTGCTGCGCT AGCTTTTTTG GCCACTGGCC GCGCGCAGCG 
CCGCGCCGCC GACGACGCGA TCGAAAAAAC CGGTGACCGG CGCGCGTCGC 

TAAGCGGTTA GGCTGGAAAG CGAAAGCATT AAGTGGCTCG CTCCCTGTAG 
ATTCGCCAAT CCGACCTTTC GCTTTCGTAA TTCACCGAGC GAGGGACATC 

CCGGAGGGTT ATTTTCCAAG GGTTGAGTCG CGGGACCCCC GGTTCGAGTC 
GGCCTCCCAA TAAAAGGTTC CCAACTCAGC GCCCTGGGGG CCAAGCTCAG 

TCGGACCGGC CGGACTGCGG CGAACGGGGG TTTGCCTCCC CGTCATGCAA 
AGCCTGGCCG GCCTGACGCC GCTTGCCCCC AAACGGAGGG GCAGTACGTT 

GACCCCGCTT GCAAATTCCT CCGGAAACAG GGACGAGCCC CTTTTTTGCT 
CTGGGGCGAA CGTTTAAGGA GGCCTTTGTC CCTGCTCGGG GAAAAAACGA 

TTTCCCAGAT GCATCCGGTG CTGCGGCAGA TGCGCCCCCC TCCTCAGCAG 
AAAGGGTCTA CGTAGGCCAC GACGCCGTCT ACGCGGGGGG AGGAGTCGTC 

CGGCAAGAGC AAGAGCAGCG GCAGACATGC AGGGCACCCT CCCCTCCTCC 
GCCGTTCTCG TTCTCGTCGC CGTCTGTACG TCCCGTGGGA GGGGAGGAGG 

TACCGCGTCA GGAGGGGCGA CATCCGCGGT TGACGCGGCA GCAGATGGTG 
ATGGCGCAGT CCTCCCCGCT GTAGGCGCCA ACTGCGCCGT CGTCTACCAC 

ATTACGAACC CCCGCGGCGC CGGGCCCGGC ACTACCTGGA CTTGGAGGAG 
TAATGCTTGG GGGCGCCGCG GCCCGGGCCG TGATGGACCT GAACCTCCTC 

GGCGAGGGCC TGGCGCGGCT AGGAGCGCCC TCTCCTGAGC GGCACCCAAG 
CCGCTCCCGG ACCGCGCCGA TCCTCGCGGG AGAGGACTCG CCGTGGGTTC 

GGTGCAGCTG AAGCGTGATA CGCGTGAGGC GTACGTGCCG CGGCAGAACC 
CCACGTCGAC TTCGCACTAT GCGCACTCCG CATGCACGGC GCCGTCTTGG 

TGTTTCGCGA CCGCGAGGGA GAGGAGCCCG AGGAGATGCG GGATCGAAAG 
ACAAAGCGCT GGCGCTCCCT CTCCTCGGGC TCCTCTACGC CCTAGCTTTC 

TTCCACGCAG GGCGCGAGCT GCGGCATGGC CTGAATCGCG AGCGGTTGCT 
AAGGTGCGTC CCGCGCTCGA CGCCGTACCG GACTTAGCGC TCGCCAACGA 

GCGCGAGGAG GACTTTGAGC CCGACGCGCG AACCGGGATT AGTCCCGCGC 
CGCGCTCCTC CTGAAACTCG GGCTGCGCGC TTGGCCCTAA TCAGGGCGCG 

6CGCACAC6T GGCGGCCGCC GACCTGGTAA CCGCATACGA GCAGACGGTG 
CGCGTGTGCA CCGCCGGCGG CTGGACCATT GGCGTATGCT CGTCTGCCAC 

AACCAGGAGA TTAACTTTCA AAAAAGCTTT AACAACCACG TGCGTACGCT 
TTGGTCCTCT AATTGAAAGT TTTTTCGAAA TTGTTG6TGC ACGCATGCGA 

TGTGGCGCGC GAGGAGGTGG CTATAGGACT GATGCATCTG TGGGACTTTG 
ACACCGCGCG CTCCTCCACC GATATCCTGA CTACGTAGAC ACCCTGAAAC 

TAAGCGCGCT GGAGCAAAAC CCAAATAGCA AGCCGCTCAT GGCGCAGCTG 
ATTCGCGCGA CCTCGTTTTG GGTTTATCGT TCGGCGAGTA CCGCGTCGAC 
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10401 GCTAAACATA GTAGAGCCCG AGGGCCGCTG GCTGCTCGAT TTGATAAACA 
CGATTTGTAT CATCTCGGGC TCCCGGCGAC CGACGAGCTA AACTATTTGT 

10451 TCCTGCAGAG CATAGTGGTG CAGGAGCGCA GCTTGAGCCT GGCTGACAAG 
AGGACGTCTC GTATCACCAC GTCCTCGCGT CGAACTCGGA CCGACTGTTC 

10501 GTGGCCGCCA TCAACTATTC CATGCTTAGC CTGGGCAAGT TTTACGCCCG 
CACCGGCGGT AGTTGATAAG GTACGAATCG GACCCGTTCA AAATGCGGGC 

10551 CAAGATATAC CATACCCCTT ACGTTCCCAT AGACAAGGAG GTAAAGATCG 
GTTC.TATATG GTATGGGGAA TGCAAGGGTA TCTGTTCCTC CATTTCTAGC 

10601 AGGGGTTCTA CATGCGCATG GCGCTGAAGG TGCTTACCTT GAGCGACGAC 
TCCCCAAGAT GTACGCGTAC CGCGACTTCC ACGAATGGAA CTCGCTGCTG 

10651 CTGGGCGTTT ATCGCAACGA GCGCATCCAC AAGGCCGTGA GCGTGAGCCG 
GACCCGCAAA TAGCGTTGCT CGCGTAGGTG TTCCGGCACT CGCACTCGGC 

10701 GCGGCGCGAG CTCAGCGACC GCGAGCTGAT GCACAGCCTG CAAAGGGCCC 
CGCCGCGCTC GAGTCGCTGG CGCTCGACTA CGTGTCGGAC GTTTCCCGGG 

10751 TGGCTGGCAC GGGCAGCGGC GATAGAGAGG CCGAGTCCTA CTTTGACGCG 
ACCGACCGTG CCCGTCGCCG CTATCTCTCC GGCTCAGGAT GAAACTGCGC 

10801 GGCGCTGACC TGCGCTGGGC CCCAAGCCGA CGCGCCCTGG AGGCAGCTGG 
CCGCGACTGG ACGCGACCCG GGGTTCGGCT GCGCGGGACC TCCGTCGACC 

10851 GGCCGGACCT GGGCTGGCGG TGGCACCCGC GCGCGCTGGC AACGTCGGCG 
CCGGCCTGGA CCCGACCGCC ACCGTGGGCG CGCGCGACCG TTGCAGCCGC 

10901 GCGTGGAGGA ATATGACGAG GACGATGAGT ACGAGCCAGA GGACGGCGAG 
CGCACCTCCT TATACTGCTC CTGCTACTCA TGCTCGGTCT CCTGCCGCTC 

10951 TACTAAGCGG TGATGTTTCT GATCAGATGA TGCAAGACGC AACGGACCCG 
ATGATTCGCC ACTACAAAGA CTAGTCTACT ACGTTCTGCG TTGCCTGGGC 

11001 GCGGTGCGGG CGGCGCTGCA GAGCCAGCCG TCCGGCCTTA ACTCCACGGA 
CGCCACGCCC GCCGCGACGT CTCGGTCGGC AGGCCGGAAT TGAGGTGCCT 

11051 CGACTGGCGC CAGGTCATGG ACCGCATCAT GTCGCTGACT GCGCGCAATC 
GCTGACCGCG GTCCAGTACC TGGCGTAGTA CAGCGACTGA CGCGCGTTAG 

11101 CTGACGCGTT CCGGCAGCAG CCGCAGGCCA ACCGGCTCTC CGCAATTCTG 
GACTGCGCAA GGCCGTCGTC GGCGTCCGGT TGGCCGAGAG GCGTTAAGAC 

11151 GAAGCGGTGG TCCCGGCGCG CGCAAACCCC ACGCACGAGA AGGTGCTGGC 
CTTCGCCACC AGGGCCGCGC GCGTTTGGGG TGCGTGCTCT TCCACGACCG 

11201 GATCGTAAAC GCGCTGGCCG AAAACAGGGC CATCCGGCCC GACGAGGCCG 
CTAGCATTTG CGCGACCGGC TTTTGTCCCG GTAGGCCGGG CTGCTCCGGC 

11251 GCCTGGTCTA CGACGCGCTG CTTCAGCGCG TGGCTCGTTA CAACAGCGGC 
CGGACCAGAT GCTGCGCGAC GAAGTCGCGC ACCGAGCAAT GTTGTCGCCG 
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GGCGCAGCGT GAGCGCGCGC AGCAGCAGGG CAACCTGGGC TCCATGGTTG 
CCGCGTCGCA CTCGCGCGCG TCGTCGTCCC GTTGGACCCG AGGTACCAAC 

CACTAAACGC CTTCCTGAGT ACACAGCCCG CCAACGTGCC GCGGGGACAG 
GTGATTTGCG GAAGGACTCA TGTGTCGGGC GGTTGCACGG CGCCCCTGTC 

GAGGACTACA CCAACTTTGT GAGCGCACTG CGGCTAATGG TGACTGAGAC 
CTCCTGATGT GGTTGAAACA CTCGCGTGAC GCCGATTACC ACTGACTCTG 

ACCGCAAAGT GAGGTGTACC AGTCTGGGCC AGACTATTTT TTCCAGACCA 
TGGCGTTTCA CTCCACATGG TCAGACCCGG TCTGATAAAA AAGGTCTGGT 

GTA6ACAAGG CCTGCAGACC GTAAACCTGA GCCAGGCTTT CAAAAACTTG 
CATCTGTTCC GGACGTCTGG CATTTGGACT CGGTCCGAAA GTTTTTGAAC 

CAGGGGCTGT GGGGGGTGCG GGCTCCCACA GGCGACCGCG CGACCGTGTC 
GTCCCCGACA CCCCCCACGC CCGAGGGTGT CCGCTGGCGC GCTGGCACAG 

TAGCTTGCTG ACGCCCAACT CGCGCCTGTT GCTGCTGCTA ATAGCGCCCT 
ATCGAACGAC TGCGGGTTGA GCGCGGACAA CGACGACGAT TATCGCGGGA 

TCACGGACAG TGGCAGCGTG TCCCGGGACA CATACCTAGG TCACTTGCTG 
AGTGCCTGTC ACCGTCGCAC AGGGCCCTGT GTATGGATCC AGTGAACGAC 

ACACTGTACC GCGAGGCCAT AGGTCAGGCG CATGTGGACG AGCATACTTT 
TGTGACATGG CGCTCCGGTA TCCAGTCCGC GTACACCTGC TCGTATGAAA 

CCAGGAGATT ACAAGTGTCA GCCGCGCGCT GGGGCAGGAG GACACGGGCA 
GGTCCTCTAA TGTTCACAGT CGGCGCGCGA CCCCGTCCTC CTGTGCCCGT 

GCCTGGAGGC AACCCTAAAC TACCTGCTGA CCAACCGGCG GCAGAAGATC 
CGGACCTCCG TTGGGATTTG ATGGACGACT GGTTGGCCGC CGTCTTCTAG 

CCCTCGTTGC ACAGTTTAAA CAGCGAGGAG GAGCGCATTT TGCGCTACGT 
GGGAGCAACG TGTCAAATTT GTCGCTCCTC CTCGCGTAAA ACGCGATGCA 

GCAGCAGAGC GTGAGCCTTA ACCTGATGCG CGACGGGGTA ACGCCCAGCG 
CGTCGTCTCG CACTCGGAAT TGGACTACGC GCTGCCCCAT TGCGGGTCGC 

TGGCGCTGGA CATGACCGCG CGCAACATGG AACCGGGCAT GTATGCCTCA 
ACCGCGACCT GTACTGGCGC GCGTTGTACC TTGGCCCGTA CATACGGAGT 

AACCGGCCGT TTATCAACCG CCTAATGGAC TACTTGCATC GCGCGGCCGC 
TTGGCCGGCA AATAGTTGGC GGATTACCTG ATGAACGTAG CGCGCCGGCG 

CGTGAACCCC GAGTATTTCA CCAATGCCAT CTTGAACCCG CACTGGCTAC 
GCACTTGGGG CTCATAAAGT GGTTACGGTA GAACTTGGGC GTGACCGATG 

CGCCCCCTGG TTTCTACACC GGGGGATTCG AGGTGCCCGA GGGTAACGAT 
GCGGGGGACC AAAGATGTGG CCCCCTAAGC TCCACGGGCT CCCATTGCTA 

6GATTCCTCT GGGACGACAT AGACGACAGC GTGTTTTCCC CGCAACCGCA 
CCTAAGGAGA CCCTGCTGtA TCTGCTGTCG CACAAAAGGG GCGTTGGCGT 
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AGGAAAGCTT CCGCAGGCCA AGCAGCTTGT CCGATCTAGG CGCTGCGGCC 
TCCTTTCGAA GGCGTCCGGT TCGTCGAACA GGCTAGATCC GCGACGCCGG 

CCGCGGTCAG ATGCTAGTAG CCCATTTCCA AGCTTGATAG GGTCTCTTAC 
GGCGCCAGTC TACGATCATC GGGTAAAGGT TCGAACTATC CCAGAGAATG 

CAGCACTCGC ACCACCCGCC CGCGCCTGCT GGGCGAGGAG GAGTACCTAA 
GTCGTGAGCG TGGTGGGCGG GCGCGGACGA CCCGCTCCTC CTCATGGATT 

ACAACTCGCT GCTGCAGCCG CAGCGCGAAA AAAACCTGCC TCCGGCATTT 
TGTTGAGCGA CGACGTCGGC GTCGCGCTTT TTTTGGACGG AGGCCGTAAA 

CCCAACAACG GGATA6AGAG CCTAGTGGAC AAGATGAGTA GATGGAAGAC 
GGGTTGTTGC CCTATCTCTC GGATCACCTG TTCTACTCAT CTACCTTCTG 

GTACGCGCAG GAGCACAGGG ACGTGCCAGG CCCGCGCCCG CCCACCCGTC 
CATGCGCGTC CTCGTGTCCC TGCACGGTCC GGGCGCGGGC GGGTGGGCAG 

GTCAAAGGCA CGACCGTCAG CGGGGTCTGG TGTGGGAGGA CGATGACTCG 
CAGTTTCCGT GCTGGCAGTC GCCCCAGACC ACACCCTCCT GCTACTGAGC 

GCAGACGACA GCAGCGTCCT GGATTTGGGA GGGAGTGGCA ACCCGTTTGC 
CGTCTGCTGT CGTCGCAGGA CCTAAACCCT CCCTCACCGT TGGGCAAACG 

GCACCTTCGC CCCAGGCTGG GGAGAATGTT TTAAAAAAAA AAAAAGCATG 
CGTGGAAGCG GGGTCCGACC CCTCTTACAA AATTTTTTTT TTTTTCGTAC 

ATGCAAAATA AAAAACTCAC CAAGGCCATG GCACCGAGCG TTGGTTTTCT 
TACGTTTTAT TTTTTGAGTG GTTCCGGTAC CGTGGCTCGC AACCAAAAGA 

TGTATTCCCC TTAGTATGCG GCGCGCGGCG ATGTATGAGG AAGGTCCTCC 
ACATAAGGGG AATCATACGC CGCGCGCCGC TACATACTCC TTCCAGGAGG 

TCCCTCCTAC GAGAGTGTGG TGAGCGCGGC GCCAGTGGCG GCGGCGCTGG 
AGGGAGGATG CTCTCACACC ACTCGCGCCG CGGTCACCGC CGCCGCGACC 

GTTCTCCCTT CGATGCTCCC CTGGACCCGC CGTTTGTGCC TCCGCGGTAC 
CAAGAGGGAA GCTACGAGGG GACCTGGGCG GCAAACACGG AGGCGCCATG 

CTGCGGCCTA CCGGGGGGAG AAACAGCATC CGTTACTCTG AGTTGGCACC 
GACGCCGGAT GGCCCCCCTC TTTGTCGTAG GCAATGAGAC TCAACCGTGG 

CCTATTCGAC ACCACCCGTG TGTACCTGGT GGACAACAAG TCAACGGATG 
GGATAAGCTG TGGTGGGCAC ACATGGACCA CCTGTTGTTC AGTTGCCTAC 

TGGCATCCCT GAACTACCAG AACGACCACA GCAACTTTCT GACCACGGTC 
ACCGTAGGGA CTTGATGGTC TTGCTGGTGT CGTTGAAAGA CTGGTGCCAG 

ATTCAAAACA ATGACTACAG CCGGGGGGAG GCAAGCACAC AGACCATCAA 
TAAGTTTTGT TACTGATGTC GGGCCCCCTC CGTTCGTGTG TCTGGTAGTT 

TCTTGACGAC CGGTCGCACT GGGGCGGCGA CCTGAAAACC ATCCTGCATA 
AGAACTGCTG GCCAGCGTGA CCCCGCCGCT GGACTTTTGG TAGGACGTAT 
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CGGGTGATGG TGTCGCGCTT GCCTACTAAG GACAATCAGG TGGAGCT6AA 
GCCCACTACC ACAGCGCGAA CGGATGATTC CTGTTAGTCC ACCTCGACTT 

ATACGAGTGG GTGGAGTTCA CGCTGCCCGA GGGCAACTAC TCCGAGACCA 
TATGCTCACC CACCTCAAGT GCGACGGGCT CCCGTTGATG AGGCTCTGGT 

TGACCATAGA CCTTATGAAC AACGCGATCG TGGAGCACTA CTTGAAAGTG 
ACTGGTATCT GGAATACTTG TTGCGCTAGC ACCTCGTGAT GAACTTTCAC 

GGCAGACAGA ACGGGGTTCT GGAAAGCGAC ATCGGGGTAA AGTTTGACAC 
CCGTCTGTCT TGCCCCAAGA CCTTTCGCTG TAGCCCCATT TCAAACTGTG 

CCGCAACTTC AGACTGGGGT TTGACCCCGT CACTGGTCTT GTCATGCCTG 
GGCGTTGAAG TCTGACCCCA AACTGGGGCA GTGACCAGAA CAGTACGGAC 

GGGTATATAC AAACGAAGCC TTCCATCCAG ACATCATTTT GCTGCCAGGA 
CCCATATATG TTTGCTTCGG AAGGTAGGTC TGTAGTAAAA CGACGGTCCT 

TGCGGGGTGG ACTTCACCCA CAGCCGCCTG AGCAACTTGT TGGGCATCCG 
ACGCCCCACC TGAAGTGGGT GTCGGCGGAC TCGTTGAACA ACCCGTAGGC 

CAAGCGGCAA CCCTTCCAGG AGGGCTTTAG GATCACCTAC GATGATCTGG 
GTTCGCCGTT GGGAAGGTCC TCCCGAAATC CTAGTGGATG CTACTAGACC 

AGGGTGGTAA CATTCCCGCA CTGTTGGATG TGGACGCCTA CCAGGCGAGC 
TCCCACCATT GTAAGGGCGT GACAACCTAC ACCTGCGGAT GGTCCGCTCG 

TTGAAAGATG ACACCGAACA GGGCGGGGGT GGCGCAGGCG GCAGCAACAG 
AACTTTCTAC TGTGGCTTGT CCCGCCCCCA CCGCGTCCGC CGTCGTTGTC 

CAGTGGCAGC GGCGCGGAAG AGAACTCCAA CGCGGCAGCC GCGGCAATGC 
GTCACCGTCG CCGCGCCTTC TCTTGAGGTT GCGCCGTCGG CGCCGTTACG 

AGCCGGTGGA GGACATGAAC GATCATGCCA TTCGCGGCGA CACCTTTGCC 
TCGGCCACCT CCTGTACTTG CTAGTACGGT AAGCGCCGCT GTGGAAACGG 

ACACGGGCTG AGGAGAAGCG CGCTGAGGCC GAAGCAGCGG CCGAAGCTGC 
TGTGCCCGAC TCCTCTTCGC GCGACTCCGG CTTCGTCGCC GGCTTCGACG 

CGCCCCCGCT GCGCAACCCG AGGTCGAGAA GCCTCAGAAG AAACCGGTGA 
GCGGGGGCGA CGCGTTGGGC TCCAGCTCTT CGGAGTCTTC TTTGGCCACT 

TCAAACCCCT GACAGAGGAC AGCAAGAAAC GCAGTTACAA CCTAATAAGC 
AGTTTGGGGA CTGTCTCCTG TCGTTCTTTG C6TCAATGTT GGATTATTCG 

AATGACAGCA CCTTCACCCA GTACCGCAGC TGGTACCTTG CATACAACTA 
TTACTGTCGT GGAAGTGGGT CATGGCGTCG ACCATGGAAC GTATGTTGAT 

CGGCGACCCT CAGACCGGAA TCCGCTCATG GACCCTGCTT TGCACTCCTG 
GCCGCTGGGA GTCTGGCCTT AGGCGAGTAC CTGGGACGAA ACGTGAGGAC 

ACGTAACCTG CGGCTCGGAG CAGGTCTACT GGTCGTTGCC AGACATGATG 
TGCATTGGAC GCCGAGCCTC GTCCAGATGA CCAGCAACGG TCTGTACTAC 
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14201 GGTGGGCGCC GAGCT6TTGC CCGTGCACTC CAAGAGCTTC TACAACGACC 
CCACCCGCGG CTC6ACAACG GGCAC6TGAG GTTCTCGAAG ATGTTGCT6G 

14251 AGGCCGTCTA CTCCCAACTC ATCCGCCAGT TTACCTCTCT GACCCACGTG 
TCCGGCAGAT GAGGGTTGAG TAGGCGGTCA AATGGAGAGA CTGGGTGCkC 

14301 TTCAATCGCT TTCCCGAGAA CCAGATTTTG GCGCGCCCGC CAGCCCCCAC 
AAGTTAGCGA AAGGGCTCTT GGTCTAAAAC CGCGCGGGCG GTCGGGGGTG 

14351 CATCACCACC GTCAGTGAAA ACGTTCCTGC TCTCACAGAT CACGGGACGC 
GTAGTGGTGG CAGTCACTTT TGCAAGGACG AGA6TGTCTA GTGCCCTGCG 

14401 TACCGCTGCG CAACAGCATC GGAGGAGTCC AGCGAGTGAC CATTACTGAC 
ATGGCGACGC GTTGTCGTAG CCTCCTCAGG TCGCTCACTG GTAATGACTG 

14451 GCCAGACGCC GCACCTGCCC CTACGTTTAC AAGGCCCTGG GCATAGTCTC 
CGGTCTGCGG CGTGGACGGG GATGCAAATG TTCCGGGACC CGTATCAGAG 

14501 GCCGCGCGTC CTATCGAGCC GCACTTTTTG AGCAAGCATG TCCATCCTTA 
CGGCGCGCAG GATAGCTCGG CGTGAAAAAC TCGTTCGTAC AGGTAGGAAT 

14551 TATCGCCCAG CAATAACACA GGCTGGGGCC TGCGCTTCCC AAGCAAGATG 
ATAGCGGGTC GTTATTGTGT CCGACCCCGG ACGCGAAGGG TTCGTTCTAC 

14 601 TTTGGCGGGG CCAAGAAGCG CTCCGACCAA CACCCAGTGC GCGTGCGCGG 
A.^CCGCCCC GGTTCTTCGC GAGGCTGGTT GTGGGTCACG CGCACGCGCC 

14651 GCACTACCGC GCGCCCTGGG GCGCGCACAA ACGCGGCCGC ACTGGGCGCA 
CGTGATGGCG CGCGGGACCC CGCGCGTGTT TGCGCCGGCG TGACCCGCGT 

14701 CCACCGTCGA TGACGCCATC GACGCGGTGG TGGAGGAGGC GCGCAACTAC 
GGTGGCAGCT ACTGCGGTAG CTGCGCCACC ACCTCCTCCG CGCGTTGATG 

14751 ACGCCCACGC CGCCACCAGT GTCCACAGTG GACGCGGCCA TTCAGACCGT 
TGCGGGTGCG GCGGTGGTCA CAGGTGTCAC CTGCGCCGGT AAGTCTGGCA 

14801 GGTGCGCGGA GCCCGGCGCT ATGCTAAAAT GAAGAGACGG CGGAGGCGCG 
CCACGCGCCT CGGGCCGCGA TACGATTTTA CTTCTCTGCC GCCTCCGCGC 

14851 TAGCACGTCG CCACCGCCGC CGACCCGGCA CTGCCGCCCA ACGCGCGGCG 
ATCGTGCAGC GGTGGCGGCG GCTGGGCCGT GACGGCGGGT TGCGCGCCGC 

14901 GCGGCCCTGC TTAACCGCGC ACGTCGCACC GGCCGACGGG CGGCCATGCG 
CGCCGGGACG AATTGGCGCG TGCAGCGTGG CCGGCTGCCC GCCGGTACGC 

14951 GGCCGCTC6A AGGCTGGCCG CGGGTATTGT CACTGTGCCC CCCAGGTCCA 
CCGGCGAGCT TCCGACCGGC GCCCATAACA GTGACACGGG GGGTCCAGGT 

15001 GGCGACGAGC GGCCGCCGCA GCAGCCGCGG CCATTAGTGC TATGACTCAG 
CCGCTGCTCG CCGGCGGCGT CGTCGGCGCC GGTAATCACG ATACTGAGTC 

15051 GGTCGCAGGG GCAACGTGTA TTGGGTGCGC GACTCGGTTA GCGGCCTGCG 
CCAGCGTCCC CGTTGCACAT AACCCACGCG CTGAGCCAAT CGCCGGACGC 
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15151 ACTTAGACTC GTACTGTTGT ATGTATCCAG CGGCGGCGGC GCGCAACGAA 
TGAATCTGAG CATGACAACA TACATAGGTC GCCGCCGCCG CGCGTTGCTT 

15201 GCTATGTCCA AGCGCAAAAT CAAAGAAGAG ATGCTCCAGG TCATCGCGCC 
CGATACAGGT TCGCGTTTTA GTTTCTTCTC TACGAGGTCC AGTAGCGCGG 

15251 GGAGATCTAT GGCCCCCCGA AGAAGGAAGA GCAGGATTAC AAGCCCCGAA 
CCTCTAGATA CCGGGGGGCT TCTTCCTTCT CGTCCTAATG TTCGGGGCTT 

15301 AGCTAAAGCG GGTCAAAAAG AAAAAGAAAG ATGATGATGA TGAACTTGAC 
TCGATTTCGC CCAGTTTTTC TTTTTCTTTC TACTACTACT ACTTGAACTG 

15351 GACGAGGTGG AACTGCTGCA CGCTACCGCG CCCAGGCGAC GGGTACAGTG 
CTGCTCCACC TTGACGACGT GCGATGGCGC GGGTCCGCTG CCCATGTCAC 

15401 GAAAGGTCGA CGCGTAAAAC GTGTTTTGCG ACCCGGCACC ACCGTAGTCT 
CTTTCCAGCT GCGCATTTTG CACAAAACGC TGGGCCGTGG TGGCATCAGA 

15451 TTACGCCCGG TGAGCGCTCC ACCCGCACCT ACAAGCGCGT GTATGATGAG 
AATGCGGGCC ACTCGCGAGG TGGGCGTGGA TGTTCGCGCA CATACTACTC 

15501 GTGTACGGCG ACGAGGACCT GCTTGAGCAG GCCAACGAGC GCCTCGGGGA 
CACATGCCGC TGCTCCTGGA C6AACTCGTC CGGTTGCTCG CGGAGCCCCT 

15551 GTTTGCCTAC GGAAAGCGGC ATAAGGACAT GCTGGCGTTG CCGCTGGACG 
CAAACGGAT6 CCTTTCGCCG TATTCCTGTA CGACCGCAAC GGCGACCTGC 

1560: AGGGCAACCC AACACCTAGC CTAAAGCCCG TAACACTGCA GCAGGTGCTG 
TCCCGTTGGG TTGTGGATCG GATTTCGGGC ATTGTGACGT CGTCCACGAC 

15651 CCCGCGCTTG CACCGTCCGA A6AAAAGCGC GGCCTAAAGC GCGAGTCTGG 
GGGCGCGAAC GTGGCAGGCT TCTTTTCGCG CCGGATTTCG CGCTCAGACC 

15701 TGACTTGGCA CCCACCGTGC AGCTGATGGT ACCCAAGCGC CAGCGACTGG 
ACTGAACCGT GGGTGGCACG TCGACTACCA TGGGTTCGCG GTCGCTGACC 

15751 AAGATGTCTT GGAAAAAATG ACCGTGGAAC CTGGGCTGGA GCCCGAGGTC 
TTCTACAGAA CCTTTTTTAC TGGCACCTTG GACCCGACCT CGGGCTCCAG 

15801 CGCGTGCGGC CAATCAAGCA GGTGGCGCCG GGACTGGGCG TGCAGACCGT 
GCGCACGCCG GTTAGTTCGT CCACCGCGGC CCTGACCCGC ACGTCTGGCA 

15851 GGACGTTCAG ATACCCACTA CCAGTAGCAC CAGTATTGCC ACCGCCACAG 
CCTGCAAGTC TATGGGTGAT GGTCATCGTG GTCATAACGG TGGCGGTGTC 

15901 AGGGCATGGA GACACAAACG TCCCCGGTTG CCTCAGCGGT GGCGGATGCC 
TCCCGTACCT CTGTGTTTGC AGGGGCCAAC GGAGTCGCCA CCGCCTACGG 

15951 GCGGTGCAG6 CGGTCGCTGC GGCCGCGTCC AAGACCTCTA CGGAGGTGCA 
CGCCACGTCC GCCAGCGACG CCGGCGCAGG TTCTGGAGAT GCCTCCACGT 

16001 AACGGACCCG TGGATGTTTC GCGTTTCAGC CCCCCGGCGC CCGCGCCGTT 
TTGCCTGGGC ACCTACAAAG CGCAAAGTCG GGGGGCCGCG GGCGCGGCAA 
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CGAGGAAGTA CGGCGCCGCC AGCGCGCTAC TGCCCGAATA TGCCCTACAT 
GCTCCTTCAT GCCGCGGCGG TCGCGCGATG ACGGGCTTAT ACGGGATGTA 

CCTTCCATTG CGCCTACCCC CGGCTATCGT GGCTACACCT ACCGCCCCAG 
GGAAGGTAAC GCGGATGGGG GCCGATAGCA CCGATGTGGA TGGCGGGGTC 

AAGACGAGCA ACTACCCGAC GCCGAACCAC CACTGGAACC CGGCGCCGCC 
TTCTGCTCGT TGATGGGCTG CGGCTTGGTG GTGACCTTGG GCGGCGGCGG 

GTCGCCGTCG CCAGCCCGTG CTGGCCCCGA TTTCCGTGCG CAGGGTGGCT 
CAGCGGCAGC GGTCGGGCAC GACCGGGGCT AAAGGCACGC GTCCCACCGA 

CGCGAAGGAG GCAGGACCCT GGTGCTGCCA ACAGCGCGCT ACCACCCCAG 
GCGCTTCCTC CGTCCTGGGA CCACGACGGT TGTCGCGCGA TGGTGGGGTC 

CATCGTTTAA AAGCCGGTCT TTGTGGTTCT TGCAGATATG GCCCTCACCT 
GTAGCAAATT TTCGGCCAGA AACACCAAGA ACGTCTATAC CGGGAGTGGA 

GCCGCCTCCG TTTCCCGGTG CCGGGATTCC GAGGAAGAAT GCACCGTAGG 
CGGCGGAGGC AAAGGGCCAC GGCCCTAAGG CTCCTTCTTA CGTGGCATCC 

AGGGGCATGG. CCGGCCACGG CCTGACGGGC GGCATGCGTC GTGCGCACCA 
TCCCCGTACC GGCCGGTGCC GGACTGCCCG CCGTACGCAG CACGCGTGGT 

CCGGCGGCGG C6CGCGTCGC ACCGTCGCAT GCGCGGCGGT ATCCTGCCCC 
GGCCGCCGCC GCGCGCAGCG TGGCAGCGTA CGCGCCGCCA TAGGACGGGG 

TCCTTATTCC ACTGATCGCC GCGGCGATTG GCGCCGTGCC CGGAATTGCA 
AGGAATAAGG TGACTAGCGG CGCCGCTAAC CGCGGCACGG GCCTTAACGT 

TCCGTGGCCT TGCAGGCGCA GAGACACTGA TTAAAAACAA GTTGCATGTG 
AGGCACCGGA ACGTCCGCGT CTCTGTGACT AATTTTTGTT CAACGTACAC 

GAAAAATCAA AATAAAAAGT CTGGACTCTC ACGCTCGCTT GGTCCTGTAA 
CTTTTTAGTT TTATTTTTCA GACCTGAGAG TGCGAGCGAA CCAGGACATT 

CTATTTTGTA GAATGGAAGA CATCAACTTT GCGTCTCTGG CCCCGCGACA 
GATAAAACAT CTTACCTTCT GTAGTTGAAA CGCAGAGACC GGGGCGCTGT 

CGGCTCGCGC CCGTTCATGG GAAACTGGCA AGATATCGGC ACCAGCAATA 
GCCGAGC6CG GGCAAGTACC CTTTGACCGT TCTATAGCCG TGGTCGTTAT 

TGAGCGGTGG CGCCTTCAGC TGGGGCTCGC TGTGGAGCGG CATTAAAAAT 
ACTCGCCACC GC6GAAGTCG ACCCCGAGCG ACACCTCGCC GTAATTTTTA 

TTCGGTTCCA CCGTTAAGAA CTATGGCAGC AAGGCCTGGA ACAGCAGCAC 
AAGCCAAGGT GGCAATTCTT GATACCGTCG TTCCGGACCT TGTCGTCGTG 

AGGCCAGATG CTGAGGGATA AGTTGAAAGA GCAAAATTTC CAACAAAAGG 
TCCGGTCTAC GACTCCCTAT TCAACTTTCT CGTTTTAAAG GTT6TTTTCC 

TGGTAGATGG CCTGGCCTCT GGCATTAGCG GGGTGGTGGA CCTGGCCAAC 
ACCATCTACC GGACCGGAGA CCGTAATCGC CCCACCACCT GGACCGGTTG 

CAGGCAGTGC AAAATAAGAT TAACAGTAAG CTTGATCCCC GCCCTCCCGT 
GTCCGTCACG TTTTATTCTA ATTGTCATTC GAACTAGGGG CGGGAGGGCA 
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17051 AAAAGCGTCC GCGCCCCGAC AGGGAAGAAA CTCTGGTGAC GCAAATAGAC 
TTTTCGCAGG CGCGGGGCTG TCCCTTCTTT GAGACCACTG CGTTTATCTG 

171C1 GAGCCTCCCT CGTACGAGGA GGCACTAAAG CAAGGCCTGC CCACCACCCG 
CTCGGAGGGA GCATGCTCCT CCGTGATTTC GTTCCGGACG GGTGGTGGGC 

17151 TCCCATCGCG CCCATGGCTA CCGGAGTGCT GGGCCAGCAC ACACCC6TAA 
AGGGTAGCGC GGGTACCGAT GGCCTCACGA CCCGGTCGTG T6TGGGCATT 

17201 CGCTGGACCT GCCTCCCCCC GCCGACACCC AGCAGAAACC TGTGCTGCCA 
GCGACCTGGA C6GAGG6GGG CGGCTGTGGG TCGTCTTTGG ACACGACGGT 

17251 GGCCCGACCG CCGTTGTTGT AACCCGTCCT HGCCOCGCGT CCCTGCGCCG 
CCGGGCTGGC GGCAACAACA TTGGGCAGGA TCGGCGCGCA GGGACGCGGC 

17301 CGCCGCCAGC GGTCCGCGAT CGTTGCGGCC CGTAGCCAGT GGCAACTGGC 
GCGGCGGTCG CCAGGCGCTA GCAACGCCGG GCATCGGTCA CCGTTGACCG 

17351 AAAGCACACT GAACAGCATC GTGGGTCTGG GGGTGCAATC CCTGAAGCGC 
TTTCGTGTGA CTTGTCGTAG CACCCA6ACC CCCACGTTAG GGACTTCGCG 

17401 CGACGATGCT TCTGATAGCT AACGTGTCGT ATGTGTGTCA TGTATGCGTC 
GCTGCTACGA AGACTATCGA TTGCACAGCA TACACACAGT ACATACGCAG 

17451 CATGTCGCCG CCAGAGGAGC TGCTGAGCCG CCGCGCGCCC GCTTTCCAAG 
GTACAGCGGC GGTCTCCTCG ACGACTCGGC GGCGCGCGGG CGAAAGGTTC 

17501 ATGGCTACCC CTTCGATGAT GCCGCAGTGG TCTTACATGC ACATCTCGGG 
TACCGATGGG GAAGCTACTA CGGCGTCACC AGAATGTACG TGTAGAGCCC 

17551 CCAGGACGCC TCGGAGTACC TGAGCCCCGG GCTGGTGCAG TTTGCCCGCG 
GGTCCTGCGG AGCCTCATG6 ACTCGGGGCC CGACCACGTC AAACGGGCGC 

17601 CCACCGAGAC GTACTTCAGC CTGAATAACA AGTTTAGAAA CCCCACGGTG 
GGTGGCTCTG CATGAAGTCG GACTTATTGT TCAAATCTTT GGGGTGCCAC 

17651 GCGCCTACGC ACGACGTGAC CACAGACCGG TCCCAGCGTT TGACGCTGCG 
CGCGGATGCG TGCTGCACTG GTGTCTGGCC AGGGTCGCAA ACTGCGACGC 

17701 GTTCATCCCT GTGGACCGTG AGGATACTGC GTACTCGTAC AAGGCGCGGT 
CAAGTAGGGA CACCTGGCAC TCCTATGACG CATGAGCATG TTCCGCGCCA 

17751 TCACCCTAGC TGTGGGTGAT AACCGTGTGC TGGACATGGC TTCCACGTAC 
AGTGGGATCG ACACCCACTA TTGGCACACG ACCTGTACCG AAGGTGCATG 

17801 TTTGACATCC GCGGCGTGCT GGACAGGGGC CCTACTTTTA AGCCCTACTC 
AAACTGTAGG CGCCGCACGA CCTGTCCCCG GGATGAAAAT TCGGGATGAG 

17B51 TGGCACTGCC TACAACGCCC TGGCTCCCAA GGGTGCCCCA AATCCTTGCG 
ACCGTGACGG ATGTTGCGGG ACCGAGGGTT CCCACGGGGT TTAGGAACGC 

17901 AATGGGATGA AGCTGCTACT GCTCTTGAAA TAAACCTAGA AGAAGAGGAC 
TTACCCTACT TCGACGATGA CGAGAACTTT ATTTGGATCT TCTTCTCCTG 
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GATGACAACG AAGACGAAGT AGACGAGCAA GCTGAGCAGC AAAAAACTCA 
CTACTGTTGC TTCTGCTTCA TCTGCTCGTT CGACTCGTCG TTTTTTGAGT 

CGTATTTGGG CAGGCGCCTT ATTCTGGTAT AAATATTACA AAGGAGGGTA 
GCATAAACCC GTCCGCGGAA TAAGACCATA TTTATAATGT TTCCTCCCAT 

TTCAAATAGG TGTCGAAGGT CAAACACCTA AATATGCCGA TAAAACATTT 
AAGTTTATCC ACAGCTTCCA GTTTGTGGAT TTATACGGCT ATTTTGTAAA 

CAACCTGAAC CTCAAATAGG AGAATCTCAG TGGTACGAAA CAGAAATTAA 
GTTGGACTTG GAGTTTATCC TCTTAGAGTC ACCATGCTTT GTCTTTAATT 

TCATGCAGCT GGGAGAGTCC TAAAAAAGAC TACCCCAATG AAACCATGTT 
AGTACGTCGA CCCTCTCAGG ATTTTTTCTG ATGGGGTTAC TTTGGTACAA 

ACGGTTCATA TGCAAAACCC ACAAATGAAA ATGGAGGGCA AGGCATTCTT 
TGCCAAGTAT ACGTTTTGGG TGTTTACTTT TACCTCCCGT TCCGTAAGAA 

GTAAAGCAAC AAAATGGAAA GCTAGAAAGT CAAGTGGAAA TGCAATTTTT 
CATTTCGTTG TTTTACCTTT CGATCTTTCA GTTCACCTTT ACGTTAAAAA 

CTCAACTACT GAGGCAGCCG CAGGCAATGG TGATAACTTG ACTCCTAAAG 
GAGTTGATGA CTCCGTCGGC GTCCGTTACC ACTATTGAAC TGAGGATTTC 

TGGTATTGTA CAGTGAAGAT GTAGATATAG AAACCCCAGA CACTCATATT 
ACCATAACAT GTCACTTCTA CATCTATATC TTTGGGGTCT GTGAGTATAA 

TCTTACATGC CCACTATTAA GGAAGGTAAC TCACGAGAAC TAATGGGCCA 
AGAATGTACG GGTGATAATT CCTTCCATTG AGTGCTCTTG ATTACCCGGT 

ACAATCTATG CCCAACAGGC CTAATTACAT TGCTTTTAGG GACAATTTTA 
TGTTAGATAC GGGTTGTCCG GATTAATGTA ACGAAAATCC CTGTTAAAAT 

TTGGTCTAAT GTATTACAAC AGCACGGGTA ATATGGGTGT TCTGGCGGGC 
AACCAGATTA CATAATGTTG TCGTGCCCAT TATACCCACA AGACCGCCCG 

CAAGCATCGC AGTTGAATGC TGTTGTAGAT TTGCAAGACA GAAACACAGA 
GTTCGTAGCG TCAACTTACG ACAACATCTA AACGTTCTGT CTTTGTGTCT 

GCTTTCATAC CAGCTTTTGC TTGATTCCAT TGGTGATAGA ACCAGGTACT 
CGAAAGTATG GTCGAAAACG AACTAAGGTA ACCACTATCT TGGTCCATGA 

TTTCTATGTG GAATCAGGCT GTTGACAGCT ATGATCCAGA TGTTAGAATT 
AAAGATACAC • CTTAGTCCGA CAACTGTCGA TACTAGGTCT ACAATCTTAA 

ATTGAAAATC ATGGAACTGA AGATGAACTT CCAAATTACT GCTTTCCACT 
TAACTTTTAG TACCTTGACT TCTACTTGAA GGTTTAATGA CGAAAGGTGA 

GGGAGGTGTG ATTAATACAG A6ACTCTTAC CAAGGTAAAA CCTAAAACAG 
CCCTCCACAC TAATTATGTC TCTGAGAATG GTTCCATTTT GGATTTTGTC 

GTCAGGAAAA TGGATGGGAA AAAGATGCTA CAGAATTTTC AGATAAAAAT 
CAGTCCTTTT ACCTACCCTT TTTCTACGAT GTCTTAAAAG TCTATTTTTA 

GAAATAAGAG TTGGAAATAA TTTTGCCATG GAAATCAATC TAAATGCCAA 
CTTTATTCTC AACCTTTATT AAAACGGTAC CTTTAGTTAG ATTTACGGTT 
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18951 AGCTAAAGTA CAGTCCTTCC AACGTAAAAA TTTCTGATAA CCCAAACACC 
TCGATTTCAT GTCAGGAAGG TTGCATTTTT AAAGACTATT GGGTTTGTGG 

19001 TACGACTACA TGAACAAGCG AGTGGTGGCT CCCGGGCTAG TGGACTGCTA 
ATGCTGATGT ACTTGTTCGC TCACCACCGA GGGCCCGATC ACCTGACGAT 

19051 CATTAACCTT GGAGCACGCT GGTCCCTTGA CTATATGGAC AACGTCAACC 
GTAATTGGAA CCTCGTGCGA CCAGGGAACT GATATACCTG TTGCAGTTGG 

19101 CATTTAACCA CCACCGCAAT GCTGGCCTGC GCTACCGCTC AATGTTGCTG 
GTAAATTGGT GGTGGCGTTA CGACCGGACG CGATGGCGAG TTACAACGAC 

19151 GGCAATGGTC 6CTATGTGCC CTTCCACATC CAGGTGCCTC AGAAGTTCTT 
CCGTTACCAG CGATACACGG GAAGGTGTAG GTCCACGGAG TCTTCAAGAA 

19201 TGCCATTAAA AACCTCCTTC TCCTGCCGGG CTCATACACC TACGAGTGGA 
ACGGTAATTT TTGGAGGAAG. AGGACGGCCC GAGTATGTGG ATGCTCACCT 

19251 ACTTCAGGAA GGATGTTAAC ATGGTTCTGC AGAGCTCCCT AGGAAATGAC 
TGAAGTCCTT CCTACAATTG TACCAAGACG TCTCGAGGGA TCCTTTACTG 

19301 CTAAGGGTTG ACGGAGCCAG CATTAAGTTT GATAGCATTT GCCTTTACGC 
GATTCCCAAC TGCCTCGGTC GTAATTCAAA CTATCGTAAA CGGAAATGCG 

19351 CACCTTCTTC CCCATGGCCC ACAACACCGC CTCCACGCTT GAGGCCATGC 
GTGGAAGAAG GGGTACCGGG TGTTGTGGCG GAGGTGCGAA CTCCGGTACG 

19401 TTAGAAACGA CACCAACGAC CAGTCCTTTA ACGACTATCT CTCCGCCGCC 
AATCTTTGCT GTGGTTGCTG GTCAGGAAAT TGCTGATAGA GAGGCGGCGG 

19451 AACATGCTCT ACCCTATACC CGCCAACGCT ACCAACGTGC CCATATCCAT 
TTGTACGAGA TGGGATATGG GCGGTTGCGA TGGTTGCACG GGTATAGGTA 

19501 CCCCTCCCGC AACTGGGCGG CTTTCCGCGG CTGGGCCTTC ACGCGCCTTA 
GGGGAGGGCG TTGACCCGCC GAAAGGCGCC GACCCGGAAG TGCGCGGAAT 

19551 AGACTAAGGA AACCCCATCA CTGGGCTCGG GCTACGACCC TTATTACACC 
TCTGATTCCT TTGGGGTAGT GACCCGAGCC CGATGCTGGG AATAATGTGG 

19601 TACTCTGGCT CTATACCCTA CCTAGATGGA ACCTTTTACC TCAACCACAC 
ATGAGACCGA GATATGGGAT GGATCTACCT TGGAAAATGG AGTTGGTGTG 

19651 CTTTAAGAAG GTGGCCATTA CCTTTGACTC TTCTGTCAGC TGGCCTGGCA 
GAAATTCTTC CACCGGTAAT GGAAACTGAG AAGACAGTCG ACCGGACCGT 

19701 ATGACCGCCT GCTTACCCCC AACGAGTTTG AAATTAAGCG CTCAGTTGAC 
TACTGGCGGA CGAATGGGGG TTGCTCAAAC TTTAATTCGC GAGTCAACTG 

19751 GGGGAGGGTT ACAACGTTGC CCAGTGTAAC ATGACCAAAG ACTGGTTCCT 
CCCCTCCCAA TGTTGCAACG GGTCACATTG TACTGGTTTC TGACCAAGGA 

19801 GGTACAAATG CTAGCTAACT ATAACATTGG CTACCAGGGC TTCTATATCC 
CCATGTTTAC GATCGATTGA TATTGTAACC GATGGTCCCG AAGATATAGG 
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CAGAGAGCTA CAAGGACCGC ATGTACTCCT TCTTTAGAAA CTTCCAGCCC 
GTCTCTCGAT GTTCCTGGCG TACATGAGGA AGAAATCTTT GAAGGTCGGG 

ATGAGCCGTC AGGTGGTGGA TGATACTAAA TACAAGGACT ACCAACAGGT 
TACTCGGCAG TCCACCACCT ACTATGATTT ATGTTCCTGA TGGTTGTCCA 

GGGCATCCTA CACCAACACA ACAACTCTGG ATTTGTTGGC TACCTTGCCC 
CCCGTAGGAT GTGGTTGTGT TGTTGAGACC TAAACAACCG ATGGAACGGG 

CCACCATGCG CGAAGGACAG GCCTACCCTG CTAACTTCCC CTATCCGCTT 
GGTGGTACGC GCTTCCTGTC CGGATGGGAC GATTGAAGGG GATAGGCGAA 

ATAGGCAAGA CCGCAGTTGA CA6CATTACC CAGAAAAAGT TTCTTTGCGA 
TATCCGTTCT GGCGTCAACT GTCGTAATGG GTCTTTTTCA AAGAAACGCT 

TCGCACCCTT TGGCGCATCC CATTCTCCAG TAACTTTATG TCCATGGGCG 
AGCGTGGGAA ACCGCGTAGG GTAAGAGGTC ATTGAAATAC AGGTACCCGC 

CACTCACAGA CCTGGGCCAA AACCTTCTCT ACGCCAACTC CGCCCACGCG 
GTGAGTGTCT GGACCCGGTT TTGGAAGAGA TGCGGTTGAG GCGGGTGCGC 

CTAGACATGA CTTTTGAGGT GGATCCCATG GACGAGCCCA CCCTTCTTTA 
GATCTGTACT GAAAACTCCA CCTAGGGTAC CTGCTCGGGT GGGAAGAAAT 

TGTTTTGTTT GAAGTCTTTG ACGTGGTCCG TGTGCACCAG CCGCACCGCG' 
ACAAAACAAA CTTCAGAAAC TGCACCAGGC ACACGTGGTC GGCGTGGCGC 

GCGTCATCGA AACCGTGTAC CTGCGCACGC CCTTCTCGGC CGGCAACGCC 
CGCAGTAGCT TTGGCACATG GACGCGTGCG GGAAGAGCCG GCCGTTGCGG 

ACAACATAAA GAAGCAAGCA ACATCAACAA CAGCTGCCGC CATGGGCTCC 
TGTTGTATTT CTTCGTTCGT TGTAGTTGTT GTCGACGGCG GTACCCGAGG 

AGTGAGCAGG AACTGAAAGC CATTGTCAAA GATCTTGGTT GTGGGCCATA 
TCACTCGTCC TTGACTTTCG GTAACAGTTT CTAGAACCAA CACCCGGTAT 

TTTTTTGGGC ACCTATGACA AGCGCTTTCC AGGCTTTGTT TCTCCACACA 
AAAAAACCCG TGGATACTGT TCGCGAAAGG TCCGAAACAA AGAGGTGTGT 

AGCTCGCCTG CGCCATAGTC AATAC6GCCG GTCGCGAGAC TGGGGGCGTA 
TCGAGCGGAC QCGGTATCAG TTATGCCGGC CAGCGCTCTG ACCCCCGCAT 

CACTGGATGG CCTTTGCCTG GAACCCGCAC TCAAAAACAT GCTACCTCTT 
GTGACCTACC GGAAACGGAC CTTGGGCGTG AGTTTTTGTA CGATGGAGAA 

TGAGCCCTTT GGCTTTTCTG ACCAGCGACT CAAGCAGGTT TACCAGTTTG 
ACTCGGGAAA CCGAAAAGAC TGGTCGCTGA GTTCGTCCAA ATGGTCAAAC 

AGTACGAGTC ACTCCTGCGC CGTAGCGCCA TTGCTTCTTC CCCCGACCGC 
TCATGCTCAG TGAGGACGCG GCATC6CGGT AACGAAGAAG GGG6CT6GCG 

7GTATAACGC TGGAAAAGTC CACCCAAAGC 6TACAGGGGC CCAACTCGGC 
ACATATTGCG ACCTTTTCAG GTGGGTTTCG CATGTCCCCG GGTTGAGCCG 

CGCCTGTGGA CTATTCTGCT GCATGTTTCT CCACGCCTTT GCCAACTGGC 
GCGGACACCT GATAAGACGA CGTACAAAGA GGTGCGGAAA CGGTTGACCG 
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20851 CCCAACTCCA TGCTCAACAG TCCCCAGGTA CAGCCCACCC TGCGTCGCAA 
GGGTTGAGGT ACGAGTTGTC AGGGGTCCAT GTCGGGTGGG ACGCAGCGTT 

20901 CCAGGAACAG CTCTACAGCT TCCTGGAGCG CCACTCGCCC TACTTCCGCA 
GGTCCTTGTC GAGATGTCGA AGGACCTCGC GGTGAGCGGG ATGAAGGCGT 

20951 GCCACAGTGC GCAGATTAGG AGCGCCACTT CTTTTTCTCA CTTGAAAAAC 
CGGTGTCACG CGTCTAATCC TCGCGGTGAA GAAAAACAGT GAACTTTTTG 

21001 ATGTAAAAAT AATGTACTAG AGACACTTTC AATAAAGGCA AATGCTTTTA 
TACATTTTTA TTACATGATC TCTGTGAAAG TTATTTCCGT TTACGAAAAT 

21051 TTTGTACACT CTCGGGTGAT TATTTACCCC CACCCTTGCC GTCTGCGCCG 
AAACATGTGA GAGCCCACTA ATAAATGGGG GTGGGAACGG CAGACGCGGC 

21101 TTTAAAAATC AAAGGGGTTC TGCCGCGCAT CGCTATGCGC CACTGGCAGG 
AAATTTTTAG TTTCCCCAAG ACGGCGCGTA GCGATACGCG GTGACCGTCC 

21151 GACACGTTGC GATACTGGTG TTTAGTGCTC CACTTAAACT CAGGCACAAC 
CTGTGCAACG CTATGACCAC AAATCACGAG GTGAATTTGA GTCCGTGTTG 

21201 CATCCGCGGC AGCTCGGTGA AGTTTTCACT CCACAGGCTG CGCACCATCA 
GTAGGCGCCG TCGAGCCACT TCAAAAGTGA GGTGTCCGAC GCGTGGTAGT 

21251 CCAACGCGTT TAGCAGGTCG GGCGCCGATA TCTTGAAGTC GCAGTTGGGG 
GGTTGCGCAA ATCGTCCAGC CCGCGGCTAT AGAACTTCAG CGTCAACCCC 

21301 CCTCCGCCCT GCGCGCGCGA GTTGCGATAC ACAGGGTTGC AGCACTGGAA 
GGAGGCGGGA CGCGCGCGCT CAACGCTATG TGTCCCAACG TCGTGACCTT 

21351 CACTATCAGC GCCGGGTGGT GCACGCTGGC CAGCACGCTC TTGTCGGAGA 
GTGATAGTCG CGGCCCACCA CGTGCGACCG GTCGTGCGAG AACAGCCTCT 

21401 TCAGATCCGC GTCCAGGTCC TCCGC6TTGC TCAGGGCGAA CGGAGTCAAC 
AGTCTAGGCG CAGGTCCAGG AGGCGCAACG AGTCCCGCTT GCCTCAGTTG 

21451 TTTGGTAGCT GCCTTCCCAA AAAGGGCGCG TGCCCAGGCT TTGAGTTGCA 
AAACCATCGA CGGAAGGGTT TTTCCCGCGC ACG6GTCCGA AACTCAACGT 

21501 CTCGCACCGT AGTGGCATCA AAAGGTGACC GTGCCCGGTC TGGGCGTTAG 
GAGCGTGGCA TCACCGTAGT TTTCCACTGG CACGGGCCAG ACCCGCAATC 

21551 GATACAGCGC CTGCATAAAA GCCTTGATCT GCTTAAAAGC CACCTGAGCC 
CTATGTCGCG GACGTATTTT CGGAACTAGA CGAATTTTCG GTGGACTCGG 

21601 TTTGCGCCTT CAGAGAAGAA CATGCCGCAA GACTTGCCGG AAAACTGATT 
AAACGCGGAA GTCTCTTCTT GTACGGCGTT CTGAACGGCC TTTTGACTAA 

21651 GGCCGGACAG GCCGCGTCGT GCACGCAGCA CCTTGCGTCG GTGTTGGAGA 
CCGGCCTGTC CGGCGCAGCA CGTGC6TCGT GGAACGCAGC CACAACCTCT 

21701 TCTGCACCAC ATTTCGGCCC CACCGGTTCT TCACGATCTT GGCCTTGCTA 
AGACGTGGTG TAAAGCCGGG GTGGCCAAGA AGTGCTAGAA CCGGAACGAT 
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AATCACGTGC TCCTTATTTA TCATAATGCT TCCGTGTAGA CACTTAAGCT 
TTAGTGCACG AGGAATAAAT AGTATTACGA AGGCACATCT GTGAATTCGA 

CGCCTTCGAT CTCAGCGCA6 CGGTGCAGCC ACAACGCGCA GCCCGTGGGC 
GCGGAAGCTA GAGTCGCGTC GCCACGTCGG TGTTGCGCGT CGGGCACCCG 

TCGTGATGCT TGTAGGTCAC CTCTGCAAAC GACTGCAGGT AC6CCTGCAG 
AGCACTACGA ACATCCA6TG GAGACGTTTG CTGACGTCCA TGCGGACGTC 

GAATCGCCCC ATCATCGTCA CAAAGGTCTT GTTGCTGGTG AAGGTCAGCT 
CTTAGCGGGG TAGTAGCAGT GTTTCCAGAA CAACGACCAC TTCCAGTCGA 

GCAACCCGCG 6TGCTCCTCG TTCAGCCAGG TCTTGCATAC GGCCGCCAGA 
CGTTGGGCGC CACGAGGA6C AAGTCGGTCC AGAACGTATG CCGGCGGTCT 

GCTTCCACTT GGTCAGGCAG TAGTTTGAAG TTCGCCTTTA GATCGTTATC 
CGAAGGTGAA CCAGTCCGTC ATCAAACTTC AAGCGGAAAT CTAGCAATAG 

CACGTGGTAC TTGTCCATCA GCGCGCGCGC AGCCTCCATG CCCTTCTCCC 
GTGCACCATG AACAGGTAGT CGCGCGCGCG TCGGAGGTAC GGGAAGAGGG 

ACGCAGACAC GATCGGCACA CTCAGCGGGT TCATCACCGT AATTTCACTT 
TGCGTCTGTG CTAGCCGTGT GAGTCGCCCA AGTAGTGGCA TTAAAGTGAA 

TCCGCTTCGC TGGGCTCTTC CTCTTCCTCT TGCGTCCGCA TACCACGCGC 
AGGCGAAGCG ACCCGAGAAG GAGAAGGAGA ACGCAGGCGT ATGGTGCGCG 

CACTGGGTCG TCTTCATTCA GCCGCCGCAC TGTGCGCTTA CCTCCTTTGC 
GTGACCCAGC AGAAGTAAGT CGGCGGCGTG ACACGCGAAT GGAGGAAACG 

CATGCTTGAT TAGCACCGGT GGGTTGCTGA AACCCACCAT TTGTAGCGCC 
GTACGAACTA ATCGTGGCCA CCCAACGACT TTGGGTGGTA AACATCGCGG 

ACATCTTCTC TTTCTTCCTC GCTGTCCACG ATTACCTCTG GTGATGGCGG 
TGTAGAAGAG AAAGAAGGAG CGACAGGTGC TAATGGAGAC CACTACCGCC 

GCGCTCGGGC TTGGGAGAAG GGCGCTTCTT TTTCTTCTTG GGCGCAATGG 
CGCGAGCCCG AACCCTCTTC CCGCGAAGAA AAAGAAGAAC CCGCGTTACC 

CCAAATCCGC CGCCGAGGTC GATGGCCGCG GGCTGGGTGT GCGCGGCACC 
GGTTTAGGCG GCGGCTCCAG CTACCGGCGC CCGACCCACA CGCGCCGTGG 

AGCGCGTCTT GTGATGAGTC TTCCTCGTCC TCGGACTCGA TACGCCGCCT 
TCGCGCAGAA CACTACTCAG AAGGAGCAGG AGCCTGAGCT ATGCGGCGGA 

CATCCGCTTT TTTGGGGGCG CCC6GGGAGG CGGCGGCGAC GGGGACGGGG 
GTAGGCGAAA AAACCCCCGC GGGCCCCTCC GCCGCCGCTG CCCCTGCCCC 

ACGACACGTC CTCCATGGTT GGGGGACGTC GCGCGGCACC GCGTCCGCGC 
TGCTGTGCAG GAGGTACCAA CCCCCTGCAG CGCGGCGTGG CGCAGGCGCG 

TCGGGGGTGG TTTCGCGCTG CTCCTCTTCC CGACTGGCCA TTTCCTTCTC 
AGCCCCCACC AAAGCGCGAC GAGGAGAAGG GCTGACCGGT AAAGGAAGAG 
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CCGCCCCCTC TGAGTTCGCC ACCACCGCCT CCACCGATGC CGCCAACGCG 
6GCGGGGGAG ACTCAAGCGG TGGTGGCGGA GGTGGCTACG GCGGTTGCGC 

CCTACCACCT TCCCCGTCGA GGCACCCCCG CTTGAGGAGG AGGAAGTGAT 
GGATGGTGGA AGGGGCAGCT CCGTGGGGGC GAACTCCTCC TCCTTCACTA 

TATCGAGCAG GACCCA6GTT TTGTAAGCGA AGACGACGAG GACCGCTCAG 
ATAGCTC6TC CTGGGTCCAA AACATTCGCT TCTGCTGCTC CTGGCGAGTC 

TACCAACAGA GGATAAAAAG CAAGACCAGG ACAACGCAGA GGCAAACGAG 
ATGGTTGTCT CCTATTTTTC GTTCTGGTCC TGTTGCGTCT CCGTTTGCTC 

GAACAAGTCG GGCGGGGGGA CGAAAGGCAT GGCGACTACC TAGATGTGGG 
CTTGTTCAGC CCGCCCCCCT GCTTTCCGTA CCGCTGATGG ATCTACACCC 

AGACGACGTG CTGTTGAAGC ATCTGCAGCG CCAGTGCGCC ATTATCTGCG 
TCTGCTGCAC GACAACTTCG TAGACGTCGC GGTCACGCGG TAATAGACGC 

ACGCGTTGCA AGAGCGCAGC GATGTGCCCC TCGCCATAGC GGATGTCAGC 
TGCGCAACGT TCTCGCGTCG CTACACGGGG AGCGGTATCG CCTACAGTCG 

CTTGCCTACG AACGCCACCT ATTCTCACCG CGCGTACCCC CCAAACGCCA 
GAACGGATGC TTGCGGTGGA TAAGAGTGGG GCGCATGGGG GGTTTGCGGT 

AGAA^ACGGC ACATGCGAGC CCAACCCGCG CCTCAACTTC TACCCCGTAT 
TCTTTTGCCG TGTACGCTCG GGTTGGGCGC GGAGTTGAAG ATGGGGCATA 

TTGCCGTGCC AGAGGTGCTT GCCACCTATC ACATCTTTTT CCAAAACTGC 
AACGGCACGG TCTCCACGAA CGGTGGATAG TGTAGAAAAA GGTTTTGACG 

AAGATACCCC TATCCTGCCG TGCCAACCGC AGCCGAGCGG ACAAGCAGCT 
TTCTATGGGG ATAGGACGGC ACGGTTGGCG TCGGCTCGCC TGTTCGTCGA 

GGCCTTGCGG CAGGGCGCTG TCATACCTGA TATCGCCTCG CTCAACGAAG 
CCGGAACGCC GTCCCGCGAC AGTATGGACT ATAGCGGAGC GAGTTGCTTC 

TGCCAAAAAT CTTTGAGGGT CTTGGACQCG ACGAGAAGCG CGCGGCAAAC 
ACGGTTTTTA GAAACTCCCA GAACCTGCGC TGCTCTTCGC GCGCCGTTTG 

GCTCTGCAAC AGGAAAACAG CGAAAATGAA AGTCACTCTG GAGTGTTGGT 
CGAGACGTTG TCCTTTTGTC GCTTTTACTT TCAGTGAGAC CTCACAACCA 

GGAACTCGAG GGTGACAACG CGCGCCTAGC CGTACTAAAA CGCAGCATCG 
CCTTGAGCTC CCACTGTTGC GCGCGGATCG GCATGATTTT GCGTCGTAGC 

AGGTCACCCA CTTTGCCTAC CCGGCACTTA ACCTACCCCC CAAGGTCATG 
TCCAGTGGGT GAAACGGATG GGCCGTGAAT TGGATGGGGG GTTCCA6TAC 

AGCACAGTCA TGA6TGAGCT GATCGTGCGC CGTGCGCAGC CCCTGGAGAG 
TCGTGTCAGT ACTCACTCGA CTAGCACGCG GCACGCGTCG GGGACCTCTC 

GGATGCAAAT TTGCAAGAAC AAACAGAGGA GGGCCTACCC GCAGTTGGCG 
CCTACGTTTA AACGTTCTTG TTTGTCTCCT CCCGGATGGG CGTCAACCGC 
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23701 GAGCGACGCA AACTAATGAT GGCCGCAGTG CTCGTTACCG TGGAGCTTGA 
CTCGCTGCGT TTGATTACTA CCGGCGTCAC GAGCAATGGC ACCTCGAACT 

23751 GTGCATGCAG CGGTTCTTTG CTGACCCGGA GATGCAGCGC AAGCTAGAGG 
CAC6TACGTC GCCAAGAAAC GACTGGGCCT CTACGTCGCG TTCGATCTCC 

23801 AAACATTGCA CTACACCTTT CGACAGGGCT ACGTACGCCA GGCCTGCAAG 
TTTGTAACGT GATGTGGAAA GCTGTCCCGA TGCATGCGGT CCGGACGTTC 

23851 ATCTCCAACG TGGAGCTCTG CAACCTGGTC TCCTACCTTG GAATTTTGCA 
TAGAGGTTGC ACCTCGAGAC GTTGGACCAG AGGATGGAAC CTTAAAACGT 

23901 CGAAAACCGC CTTGGGCAAA ACGTGCTTCA TTCCACGCTC AAGGGCGAGG 
GCTTTTGGCG GAACCCGTTT TGCACGAAGT AAGGTGCGAG TTCCCGCTCC 

23951 CGCGCCGCGA CTACGTCCGC GACTGCGTTT ACTTATTtCT ATGCTACACC 
GCGCGGCGCT GATGCAGGCG CTGACGCAAA TGAATAAAGA TACGATGTGG 

24001 TGGCAGACGG CCATGGGCGT TTGGCAGCAG TGCTTGGAGG AGTGCAACCT 
ACCGTCTGCC GGTACCCGCA AACCGTCGTC ACGAACCTCC TCACGTTGGA 

24051 CAAGGAGCTG CAGAAACTGC TAAAGCAAAA CTTGAAGGAC CTATGGACGG 
GTTCCTCGAC GTCTTTGACG ATTTCGTTTT GAACTTCCTG GATACCTGCC 

24101 CCTTCAACGA GCGCTCCGTG GCCGCGCACC TGGC.GGACAT CATTTTCCCC 
GGAAGTTGCT CGCGAGGCAC CGGCGCGTGG ACCGCCTGTA GTAAAAGGGG 

24151 GAACGCCTGC TTAAAACCCT GCAACAGGGT CTGCCAGACT TCACCAGTCA 
CTTGCGGACG AATTTTGGGA CGTTGTCCCA GACGGTCTGA AGTGGTCAGT 

24201 AAGCATGTTG CAGAACTTTA GGAACTTTAT CCTAGAGCGC TCAGGAATCT 
TTCGTACAAC GTCTTGAAAT CCTTGAAATA GGATCTCGCG AGTCCTTAGA 

24251 TGCCCGCCAC CTGCTGTGCA CTTCCTAGCG ACTTTGTGCC CATTAAGTAC 
ACGGGCGGTG GACGACACGT GAAGGATCGC TGAAACACGG GTAATTCATG 

24301 CGCGAATGCC CTCCGCCGCT TTGGGGCCAC TGCTACCTTC TGCAGCTAGC 
GCGCTTACGG GAGGCGGCGA AACCCCGGTG ACGATGGAAG ACGTCGATCG 

24351 CAACTACCTT GCCTACCACT CTGACATAAT GGAAGACGTG AGCGGTGACG 
GTTGATGGAA CGGATGGTGA GACTGTATTA CCTTCTGCAC TCGCCACTGC 

24401 GTCTACTGGA GTGTCACTGT C6CT<3CAACC TATGCACCCC GCACCGCTCC 
CAGATGACCT CACA6TGACA GCGACGTTGG ATACGTGGGG CGTGGCGAGG 

24451 CTGGTTTGCA ATTCGCAGCT GCTTAACGAA AGTCAAATTA TCGGTACCTT 
GACCAAACGT TAAGCGTCGA CGAATTGCTT TCAGTTTAAT AGCCATGGAA 

24501 TGAGCTGCAG CGTCCCTCGC CTGAC6AAAA GTCCGCGGCT CCGGGGTTGA 
ACTCGACGTC CCAGGGAGCG GACTGCTTTT CAGGCGCCGA GGCCCCAACT 

24551 AACTCACTCC GGGGCTGTGG ACGTCGGCTT ACCTTCGCAA ATTTGTACCT 
TTGAGTGAGG CCCCGACACC TGCAGCCGAA TGGAAGCGTT TAAACATGGA 
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24601 GAGGACTACC ACGCCCACGA GATTAGGTTC TACGAAGACC AATCCCGCCC 
CTCCTGATGG TGCGGGTGCT CTAATCCAAG ATGCTTCTGG TTAGGGCGGG 

24651 GCCTAATGCG GAGCTTACCG CCTGCGTCAT TACCCAGGGC CACATTCTTG 
CGGATTACGC CTCGAATGGC GGACGCAGTA ATGGGTCCCG GTGTAAGAAC 

24701 GCCAATTGCA AGCCATCAAC AAAGCCCGCC AAGAGTTTCT GCTACGAAAG 
CGGTTAACGT TCGGTAGTTG TTTCGGGCGG TTCTCAAAGA CGATGCTTTC 

24751 GGACGGGGGG TTTACTTGGA CCCCCAGTCC GGCGAGGAGC TCAACCCAAT 
CCTGCCCCCC AAATGAACCT GGGGGTCAGG CCGCTCCTCG AGTTGGGTTA 

24801 CCCCCCGCCG CCGCAGCCCT ATCAGCAGCA GCCGCGGGCC CTTGCTTCCC 
GGGGGGCGGC GGCGTCGGGA TAGTCGTCGT CGGCGCCCGG GAACGAAGGG 

24851 AGGATGGCAC CCAAAAA6AA GCTGCAGCTG CCGCCGCCAC CCACGGACGA 
TCCTACCGTG GGTTTTTCTT C6ACGTCGAC GGCGGCGGTG GGTGCCTGCT 

24901 GGAGGAATAC TGGGACAGTC AGGCAGAGGA GGTTTTGGAC GAGGAGGAGG 
CCTCCTTATG ACCCTGTCAG TCCGTCTCCT CCAAAACCTG CTCCTCCTCC 

24951 AGGACATGAT GGAAGACTGG GAGAGCCTAG ACGAGGAAGC TTCCGAGGTC 
TCCTGTACTA CCTTCTGACC CTCTCGGATC TGCTCCTTCG AAGGCTCCAG 

25001 GAAGAGGTGT CAGACGAAAC ACCGTCACCC TCGGTCGCAT TCCCCTCGCC 
CTTCTCCACA GTCTGCTTTG TGGCAGTGGG AGCCAGCGTA AGGGGAGCGG 

25051 GGCGCCCCAG AAATCGGCAA CCGGTTCCAG CATGGCTACA ACCTCCGCTC 
CCGCGGGGTC TTTAGCCGTT GGCCAAGGTC GTACCGATGT TGGAGGCGAG 

25101 CTCAGGCGCC GCCGGCACTG CCCGTTCGCC GACCCAACCG TAGATGGGAC 
GAGTCCGCGG CGGCCGTGAC GGGCAAGCGG CTGGGTTGGC ATCTACCCTG 

25151 ACCACTGGAA CCAGGGCCGG TAAGTCCAAG CAGCCGCCGC CGTTAGCCCA 
TGGTGACCTT GGTCCCGGCC ATTCAGGTTC GTCGGCGGCG GCAATCGGGT 

25201 AGAGCAACAA CAGCGCCAAG GCTACCGCTC ATGGCGCGGG CACAAGAACG 
TCTCGTTGTT GTCGCGGTTC CGATGGCGAG TACCGCGCCC GTGTTCTTGC 

25251 CCATAGTTGC TTGCTTGCAA GACTGTGGGG GCAACATCTC CTTCGCCCGC 
GGTATCAACG AACGAACGTT CTGACACCCC CGTTGTAGAG GAAGCGGGCG 

25301 CGCTTTCTTC TCTACCATCA CGGCGTGGCC TTCCCCCGTA ACATCCTGCA 
GCGAAAGAAG AGATGGTAGT GCCGCACCGG AAGGGGGCAT TGTAGGACGT 

25351 TTACTACCGT CATCTCTACA GCCCATACTG CACCGGCGGC AGCGGCAGCA 
AATGATGGCA GTAGAGATGT CGGGTATGAC GTGGCCGCCG TCGCCGTCGT 

25401 ACAGCAGCGG CCACACAGAA GCAAAGGCGA CCGGATAGCA AGACTCTGAC 
TGTCGTCGCC GGTGTGTCTT C6TTTCCGCT GGCCTATCGT TCTGAGACTG 

25451 AAAGCCCAAG AAATCCACAG CGGCGGCAGC AGCAGGAGGA GGAGCGCTGC 
TTTCGGGTTC TTTAGGTGTC GCCGCCGTCG TCGTCCTCCT CCTCGCGACG 

25501 GTCTGGCGCC CAACGAACCC GTATCGACCC GCGAGCTTAG AAACAGGATT 
CAGACCGCGG GTTGCTTGGG CATAGCTGGG CGCTCGAATC TTTGTCCTAA 
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TTTCCCACTC TGTATGCTAT ATTTCAACAG AGCAGGGGCC AAGAACAAGA 
AAAGGGTGAG ACATACGATA TAAAGTTGTC TCGTCCCCGG TTCTTGTTCT 

GCTGAAAATA AAAAACAGGT CTCTGCGATC CCTCACCCGC AGCTGCCTGT 
CGACTTTTAT TTTTTGTCCA GAGACGCTAG GGAGTGGGCG TCGACGGACA 

ATCACAAAAG CGAAGATCAG CTTCGGCGCA CGCTGGAAGA CGCGGAGGCT 
TAGTGTTTTC GCTTCTAGTC GAAGCCGCGT GCGACCTTCT GCGCCTCCGA 

CTCTTCAGTA AATACTGCGC GCTGACTCTT AAGGACTAGT TTCGCGCCCT 
GAGAAGTCAT TTATGACGCG CGACTGAGAA TTCCTGATCA AAGCGCGGGA 

TTCTCAAATT TAAGCGCGAA AACTACGTCA TCTCCAGCGG CCACACCCGG 
AAGAGTTTAA ATTCGCGCTT TTGATGCAGT AGAG6TCGCC GGTGTGGGCC 

CGCCAGCACC TGTTGTCAGC GCCATTATGA GCAAGGAAAT TCCCACGCCC 
GCGGTCGTGG ACAACAGTCG CGGTAATACT CGTTCCTTTA AGGGTGCGGG 

TACATGTGGA GTTACCAGCC ACAAATGGGA CTTGCGGCTG GAGCTGCCCA 
ATGTACACCT CAATGGTCGG TGTTTACCCT GAACGCCGAC CTCGACGGGT 

AGACTACTCA ACCCGAATAA ACTACATGAG CGCGGGACCC CACATGATAT 
TCTGATGAGT TGGGCTTATT TGATGTACTC GCGCCCTGGG GTGTACTATA 

CCCGGGTCAA CGGAATACGC GCCCACCGAA ACCGAATTCT CCTGGAACAG 
GGGCCCAGTT GCCTTATGCG CGGGTGGCTT TGGCTTAAGA GGACCTTGTC 

GCGGCTATTA CCACCACACC TCGTAATAAC CTTAATCCCC GTAGTTGGCC 
CGCCGATAAT GGTGGTGTGG AGCATTATTG GAATTAGGGG CATCAACCGG 

CGCTGCCCTG GTGTACCAGG AAAGTCCCGC TCCCACCACT GTGGTACTTC 
GCGACGGGAC CACATGGTCC TTTCAGGGCG AGGGTGGTGA CACCATGAAG 

CCAGAGACGC CCAGGCCGAA GTTCAGATGA CTAACTCAGG GGCGCAGCTT 
GGTCTCTGCG GGTCCGGCTT CAAGTCTACT GATTGAGTCC CCGCGTCGAA 

GCGGGCGGCT TTCGTCACAG GGTGCGGTCG CCCGGGCAGG GTATAACTCA 
CGCCCGCCGA AAGCAGTGTC CCACGCCAGC GGGCCCGTCC CATATTGAGT 

CCTGACAATC AGAGGGCGAG GTATTCAGCT CAACGACGAG TCGGTGAGCT 
GGACTGTTAG TCTCCCGCTC CATAAGTCGA GTTGCTGCTC AGCCACTCGA 

CCTCGCTTGG TCTCCGTCCG GACGGGACAT TTCAGATCGG CGGCGCCGGC 
GGAGCGAACC AGAGGCAGGC CTGCCCTGTA AAGTCTAGCC GCCGCGGCCG 

CGCTCTTCAT TCACGCCTCG TGAGGCAATC CTAACTCTGC AGACCTCGTC 
GCGAGAAGTA AGTGCGGAGC AGTCCGTTAG GATTGAGACG TCTGGAGCAG 

CTCTGAGCCG CGCTCTGGAG GCATTGGAAC TCTGCAATTT ATTGAGGAGT 
GAGACTCGGC GCGAGACCTC CGTAACCTTG AGACGTTAAA TAACTCCTCA 

TTGTGCCATC GGTCTACTTT AACCCCTTCT CGGGACCTCC CGGCCACTAT 
AACACGGTAG CCAGATGAAA TT6GGGAAGA GCCCTGGAGG GCCGGTGATA 

CCGGATCAAT TTATTCCTAA CTTTGACGCG GTAAAGGACT CGGCGGACGG 
GGCCTAGTTA AATAAGGATT 6AAACTGCGC CATTTCCTGA GCCGCCTGCC 
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26501 CTACGACTGA ATGTTAAGTG GAGAGGCAGA GCAACTGCGC CTGAAACACC 
GATGCTGACT TACAATTCAC CTCTCCGTCT C6TTGACGCG GACTTTGTGG 

26551 TGGTCCACTG TCGCCGCCAC AAGTGCTTTG CCCGCGACTC CGGTGAGTTT 
ACCAGGTGAC AGCGGCGGTG TTCACGAAAC GGGCGCTGAG GCCACTCAAA 

26601 TGCTACTTTG AATTGCCCGA GGATCATATC GAGGGCCCGG CGCACGGCGT 
AC6ATGAAAC TTAACGGGCT CCTAGTATAG CTCCCGGGCC GCGTGCCGCA 

26651 CCGGCTTACC GCCCAGGGAG AGCTTGCCCG TAGCCTGATT CGGGAGTTTA 
GGCCGAATGG CGGGTCCCTC TCGAACGGGC ATCGGACTAA GCCCTCAAAT 

26701 CCCAGCGCCC CCTGCTAGTT GAGCGGGACA GGGGACCCTG TGTTCTCACT 
GGGTCGCGGG GGACGATCAA CTCGCCCTGT CCCCTGGGAC ACAAGAGTGA 

26751 GTGATTTGCA ACTGTCCTAA CCCTGGATTA CATCAAGATC TTTGTTGCCA 
CACTAAACGT TGACAGGATT GGGACCTAAT GTAGTTCTAG AAACAACGGT 

26801 TCTCTGTGCT GAGTATAATA AATACAGAAA TTAAAATATA CTGGGGCTCC 
AGAGACACGA CTCATATTAT TTATGTCTTT AATTTTATAT GACCCCGAGG 

26851 TATCGCCATC CTGTAAACGC CACCGTCTTC ACCCGCCCAA GCAAACCAAG 
ATAGCGGTAG GACATTTGCG GTGGCAGAAG TGGGCGGGTT CGTTTGGTTC 

26901 GCGAACCTTA CCTGGTACTT TTAACATCTC TCCCTCTGTG ATTTACAACA 
CGCTTGGAAT GGACCATGAA AATTGTAGAG AGGGAGACAC TAAATGTTGT 

26951 GTTTCAACCC AGACGGAGTG AGTCTACGAG AGAACCTCTC CGAGCTCAGC 
CAAAGTTGGG TCTGCCTCAC TCAGATGCTC TCTTGGAGAG GCTCGAGTCG 

27001 TACTCCATCA GAAAAAACAC CACCCTCCTT ACCTGCCGGG AACGTACGAG 
ATGAGGTAGT CTTTTTTGTG GTGGGAGGAA TGGACGGCCC TTGCATGCTC 

27051 TGCGTCACCG GCCGCTGCAC CACACCTACC GCCTGACCGT AAACCAGACT 
ACGCAGTGGC CGGCGACGTG GTGTGGATGG CGGACTGGCA TTTGGTCTGA 

27101 TTTTCCGGAC AGACCTCAAT AACTCTGTTT ACCAGAACAG GAGGTGAGCT 
AAAAGGCCTG TCTGGAGTTA TTGAGACAAA TGGTCTTGTC CTCCACTCGA 

27151 TAGAAAACCC TTAGGGTATT AGGCCAAAGG CGCAGCTACT GTGGGGTTTA 
ATCTTTTGGG AATCCCATAA TCCGGTTTCC GCGTCGATGA CACCCCAAAT 

27201 TGAACAATTC AAGCAACTCT ACGGGCTATT CTAATTCAGG TTTCTCTAGA 
ACTTGTTAAG TTCGTTGAGA TGCCCGATAA GATTAAGTCC AAAGAGATCT 

27251 ATCGGGGTTG GG6TTATTCT CTGTCTTGTG ATTCTCTTTA TTCTTATACT 
TAGCCCCAAC CCCAATAAGA GACAGAACAC TAAGAGAAAT AAGAATATGA 

27301 AACGCTTCTC TGCCTAAGGC TCGCCGCCTG CTGTGTGCAC ATTTGCATTT 
TT6CGAAGAG ACGGATTCCG AGCGGCGGAC GACACACGTG TAAACGTAAA 

27351 ATTGTCA6CT TTTTAAACGC TGGGGTCGCC ACCCAAGATG ATTAGGTACA 
TAACAGTCGA AAAATTTGCG ACCCCAGCGG TCGGTTCTAC TAATCCATGT 

27401 TAATCCTAGG TTTACTCACC CTTGCGTCAG CCCACGGTAC CACCCAAAAG 
ATTAGGATCC AAATGAGTGG GAACGCAGTC GGGTGCCATG GTGGGTTTTC 
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TGAGTGCACC ACTCTTATAA AATGCACCAC AGAACATGAA AAGCTGCTTA 
ACTCACGTGG TGAGAATATT TTACGTGGTG TCTTGTACTT TTCGACGAAT 

TTCGCCACAA AAACAAAATT GGCAAGTATG CTGTTTATGC TATTTGGCAG 
AAGCGGTGTT TTTGTTTTAA CCGTTCATAC GACAAATACG ATAAACCGTC 

CCAGGTGACA CTACAGAGTA TAATGTTACA GTTTTCCAGG GTAAAAGTCA 
GGTCCACTGT GATGTCTCAT ATTACAATGT CAAAAGGTCC CATTTTCAGT 

TAAAACTTTT ATGTATACTT TTCCATTTTA TGAAATGTGC GACATTACCA 
ATTTTGAAAA TACATATGAA AAGGTAAAAT ACTTTACACG CTGTAATGGT 

TGTACATGAG CAAACAGTAT AAGTTGTGGC CCCCACAAAA TTGTGTGGAA 
ACATGTACTC GTTTGTCATA TTCAACACCG GGGGTGTTTT AACACACCTT 

AACACTGGCA CTTTCTGCTG CACTGCTATG CTAATTACAG TGCTCGCTTT 
TTGTGACCGT GAAAGACGAC GTGACGATAC GATTAATGTC ACGAGCGAAA 

GGTCTGTACC CTACTCTATA TTAAATACAA AAGCAGACGC AGCTTTATTG 
CCAGACATGG GATGAGATAT AATTTATGTT TTCGTCTGCG TCGAAATAAC 

AGGAAAAGAA AATGCCTTAA TTTACTAAGT TACAAAGCTA ATGTCACCAC 
TCCTTTTCTT TTACGGAATT AAATGATTCA ATGTTTCGAT TACAGTGGTG 

TAACTGCTTT ACTCGCTGCT TGCAAAACAA ATTCAAAAAG TTAGCATTAT 
ATTGACGAAA TGAGCGACGA ACGTTTTGTT TAAGTTTTTC AATCGTAATA 

AATTAGAATA GGATTTAAAC CCCCCGGTCA TTTCCTGCTC AATACCATTC 
TTAATCTTAT CCTAAATTTG GGGGGCCAGT AAAGGACGAG TTATGGTAAG 

CCCTGAACAA TTGACTCTAT GTGGGATATG CTCCAGCGCT ACAACCTTGA 
GGGACTTGTT AACTGAGATA CACCCTATAC GAGGTCGCGA TGTTGGAACT 

AGTCAGGCTT CCTGGATGTC AGCATCTGAC TTTGGCCAGC ACCTGTCCCG 
TCAGTCCGAA GGACCTACAG TCGTAGACTG AAACCGGTCG TGGACAGGGC 

CGGATTTGTT CCAGTCCAAC TACAGCGACC CACCCTAACA GAGATGACCA 
GCCTAAACAA GGTCAGGTTG ATGTCGCTGG GTGGGATTGT CTCTACTGGT 

ACACAACCAA CGCGGCCGCC GCTACCGGAC TTACATCTAC CACAAATACA 
TGTGTTGGTT GCGCCGGCG6 CGATGGCCTG AATGTAGATG 6TGTTTATGT 

CCCCAAGTTT CTGCCTTTGT CAATAACTGG GATAACTTGG GCATGTGGTG 
GGGGTTCAAA GACGGAAACA GTTATTGACC CTATTGAACC CGTACACCAC 

GTTCTCCATA GCGCTTATGT TTGTATGCCT TATTATTATG TGGCTCATCT 
CAAGAGGTAT CGCGAATACA AACATACGGA ATAATAATAC ACCGAGTAGA 

GCTGCCTAAA GCGCAAACGC GCCCGACCAC CCATCTATAG TCCCATCATT 
CGACGGATTT CGCGTTTGCG CGGGCTGGTG GGTAGATATC AGGGTAGTAA 

GTGCTACACC CAAACAATGA TGGAATCCAT AGATTGGACG GACTGAAACA 
CACGATGTGG GTTTGTTACT ACCTTAGGTA TCTAACCTGC CTGACTTTGT 
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2B451 TTTTATATTA CTGACCCTTG TTGCGCTTTT TTGTGCGTGC TCCACATTGG 
AAAATATAAT GACTGGGAAC AACGCGAAAA AACACGCACG AGGTGTAACC 

28 501 CTGCGGTTTC TCACATCGAA GTAGACTGCA TTCCAGCCTT CACAGTCTAT 
GACGCCAAAG AGTGTAGCTT CATCTGACGT AAGGTCGGAA GTGTCAGATA 

28551 TTGCTTTACG GATTTGTCAC CCTCACGCTC ATCTGCAGCC TCATCACT6T 
AACGAAATGC CTAAACAGTG GGAGTGCGAG TAGACGTCGG AGTAGTGACA 

28601 GGTCATCGCC TTTATCCAGT GCATTGACTG GGTCTGTGTG CGCTTT6CAT 
CCAGTAGCGG AAATAGGTCA CGTAACTGAC CCAGACACAC GCGAAACGTA 

28651 ATCTCAGACA CCATCCCCAG TACAGGGACA GGACTATAGC TGAGCTTCTT 
TAGAGTCTGT GGTAGGGGTC ATGTCCCTGT CCTGATATCG ACTCGAAGAA 

28701 AGAATTCTTT AATTATGAAA TTTACTGTGA CTTTTCTGCT GATTATTT6C 
TCTTAAGAAA TTAATACTTT AAATGACACT GAAAAGACGA CTAATAAACG 

28751 ACCCTATCTG CGTTTTGTTC CCCGACCTCC AAGCCTCAAA GACATATATC 
TGGGATAGAC GCAAAACAAG GGGCTGGAGG TTCGGAGTTT CTGTATATAG 

28801 ATGCAGATTC ACTCGTATAT GGAATATTCC AAGTTGCTAC AATGAAAAAA 
TACGTCTAAG TGAGCATATA CCTTATAAGG TTCAACGATG TTACTTTTTT 

28851 GCGATCTTTC CGAAGCCTGG TTATATGCAA TCATCTCTGT TATGGTGTTC 
CGCTAGAAAG GCTTCGGACC AATATACGTT AGTAGAGACA ATACCACAAG 

28901 TGCAGTACCA TCTTAGCCCT AGCTATATAT CCCTACCTTG ACATTGGCTG 
ACGTCATGGT AGAATCGGGA TCGATATATA GGGATGGAAC TGTAACCGAC 

28951 GAACGCAATA GATGCCATGA ACCACCCAAC TTTCCCCGCG CCCGCTATGC 
CTTGCGTTAT CTACGGTACT TGGTGGGTTG AAAGGGGCGC GGGCGATACG 

29001 TTCCACTGCA ACAAGTTGTT GCCGGCGGCT TTGTCCCAGC CAATCAGCCT 
AAGGTGACGT TGTTCAACAA CGGCCGCCGA AACAGGGTCG GTTAGTCGGA 

29051 CGCCCACCTT CTCCCACCCC CACTGAAATC AGCTACTTTA ATCTAACAGG 
GCGGGTGGAA GAGGGTGGGG GTGACTTTAG TCGATGAAAT TAGATTGTCC 

29101 AGGAGATGAC TGACACCCTA GATCTAGAAA TGGACGGAAT TATTACAGAG 
TCCTCTACTG ACTGTGGGAT CTAGATCTTT ACCTGCCTTA ATAATGTCTC 

29151 CAGCGCCTGC TAGAAAGACG CAGGjSCAGCG GCCGAGCAAC AGCGCATGAA 
GTCGCGGACG ATCTTTCTGC GTCCCGTCGC CGGCTCGTTG TCGCGTACTT 

29201 TCAAGAGCTC CAAGACATGG TTAACTTGCA CCAGTGCAAA AGGGGTATCT 
AGTTCTCGAG GTTCTGTACC AATTGAACGT GGTCACGTTT TCCCCATAGA 

29251 TTTGTCTCGT AAAGCAGGCC AAA6TCACCT ACGACAGTAA TACCACCGGA 
AAACAGAGCA TTTCGTCCGG TTTCAGTGGA TGCT6TCATT ATGGTGGCCT 

29301 CACCGCCTTA GCTACAAGTT GCCAACCAAG CGTCAGAAAT TGGTGGTCAT 
GTGGCGGAAT CGATGTTCAA CGGTTGGTTC GCAGTCTTTA ACCACCAGTA 
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29401 GCTGCATTCA CTCACCTTGT CAAGGACCTG AGGATCTCTG CACCCTTATT 
CGACGTAAGT GAGTGGAACA GTTCCTGGAC TCCTAGAGAC GTGGGAATAA 

29451 AAGACCCTGT GCGGTCTCAA AGATCTTATT CCCTTTAACT AATAAAAAAA 
TTCTGGGACA CGCCAGAGTT TCTAGAATAA GGGAAATTGA TTATTTTTTT 

29501 AATAATAAAG CATCACTTAC TTAAAATCAG TTAGCAAATT TCTGTC.CAGT 
TTATTATTTC GTAGTGAATG AATTTTAGTC AATCGTTTAA AGACAGGTCA 

29551 TTATTCAGCA GCACCTCCTT GCCCTCCTCC CAGCTCTGGT ATTGCAGCTT 
AATAAGTCGT CGTGGAGGAA CGGGAGGAGG GTCGAGACCA TAACGTCGAA 

29601 CCTCCTGGCT GCAAACTTTC TCCACAATCT AAATGGAATG TCAGTTTCCT 
GGAGGACCGA CGTTTGAAAG AGGTGTTAGA TTTACCTTAC AGTCAAAGGA 

29651 CCTGTTCCTG TCCATCCGCA CCCACTATCT TCATGTTGTT GCAGATGAAG 
GGACAAGGAC AGGTAGGCGT GGGTGATAGA AGTACAACAA CGTCTACTTC 

29701 CGCGCAAGAC CGTCTGAAGA TACCTTCAAC CCCGTGTATC CATATGACAC 
GCGCGTTCTG GCAGACTTCT ATGGAAGTTG GGGCACATAG GTATACTGTG 

29751 GGAAACCGGT CCTCCAACTG TGCCTTTTCT TACTCCTCCC TTTGTATCCC 
CCTTTGGCCA GGAGGTTGAC ACGGAAAAGA ATGAGGAGGG AAACATAGGG 

29801 CCAATGGGTT TCAAGAGAGT CCCCCTGGGG TACTCTCTTT GCGCCTATCC 
GGTTACCCAA AGTTCTCTCA GGGGGACCCC ATGAGAGAAA CGCGGATAGG 

29851 GAACCTCTAG TTACCTCCAA TGGCATGCTT GCGCTCAAAA TGGGCAACGG 
CTTGGAGATC AATGGAGGTT ACCGTACGAA CGCGAGTTTT ACCCGTTGCC 

29901 CCTCTCTCTG GACGAGGCCG GCAACCTTAC CTCCCAAAAT GTAACCACTG 
GGAGAGAGAC CTGCTCCGGC CGTTGGAATG GAGGGTTTTA CATTGGTGAC 

29951 TGAGCCCACC TCTCAAAAAA ACCAAGTCAA ACATAAACCT GGAAATATCT 
ACTCGGGTGG AGAGTTTTTT TGGTTCAGTT TGTATTTGGA CCTTTATAGA 

30001 GCACCCCTCA CAGTTACCTC AGAAGCCCTA ACTGTGGCTG CCGCCGCACC 
CGTGGGGAGT GTCAATGGAG TCTTCGGGAT TGACACCGAC GGCGGCGTGG 

30051 TCTAATGGTC GCGGGCAACA CACTCACCAT GCAATCACAG GCCCCGCTAA 
AGATTACCAG CGCCCGTTGT GTGAGTGGTA CGTTAGTGTC CGGGGCGATT 

30101 CCGTGCACGA CTCCAAACTT AGCATTGCCA CCCAAGGACC CCTCACAGTG 
GGCACGTGCT GAGGTTTGAA TCGTAACGGT GGGTTCCTGG GGAGT6TCAC 

30151 TCAGAAGGAA AGCTAGCCCT GCAAACATCA GGCCCCCTCA CCACCACCGA 
AGTCTTCCTT TCGATCGGGA CGTTTGTAGT CCGGGGGAGT GGTGGTGGCT 

30201 TAGCAGTACC CTTACTATCA CTGCCTCACC CCCTCTAACT ACTGCCACTG 
ATCGTCATGG GAATGATAGT GACGGA6TGG GGGAGATTGA TGACGGT6AC 

30251 GTAGCTTGGG CATTGACTTG AAAGAGCCCA TTTATACACA AAATGGAAAA 
CATCGAACCC GTAACTGAAC TTTCTCGGGT AAATATGTGT TTTACCTTTT 
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30351 TTTGACCGTA GCAACTGGTC CAGGTGTGAC TATTAATAAT ACTTCCTTGC 
AAACTGGCAT CGTTGACCAG GTCCACACTG ATAATTATTA TGAAGGAACG 

30401 AAACTAAAGT TACTGGAGCC TTGGGTTTTG ATTCACAAGG CAATATGCAA 
TTTGATTTCA ATGACCTCGG AACCCAAAAC TAAGTGTTCC GTTATACGTT 

30451 CTTAATGTAG CAGGAGGACT AAGGATTGAT TCTCAAAACA GACGCCTTAT 
GAATTACATC GTCCTCCTGA TTCCTAACTA AGAGTTTTGT CTGCGGAATA 

30501 ACTTGATGTT AGTTATCCGT TT6ATGCTCA AAACCAACTA AATCTAAGAC 
TGAACTACAA TCAATAGGCA AACTACGAGT TTrGGTTGAT TTAGATTCTG 

30551 TAGGACAGGG CCCTCTTTTT ATAAACTCAG CCCACAACTT GGATATTAAC 
ATCCTGTCCC GGGAGAAAAA TATTTGAGTC GGGTGTTGAA CCTATAATTG 

30601 TACAACAAAG GCCTTTACTT GTTTACAGCT TCAAACAATT CCAAAAAGCT 
ATGTTGTTTC CGGAAATGAA CAAATGTCGA AGTTTGTTAA GGTTTTTCGA 

30651 TGAGGTTAAC CTAAGCACTG CCAAGGGGTT GATGTTTGAC GCTACAGCCA 
ACTCCAATTG GATTCGTGAC GGTTCCCCAA CTACAAACTG CGATGTCGGT 

30701 TAGCCATTAA TGCAGGAGAT GGGCTTGAAT TTGGTTCACC TAATGCACCA 
ATCGGTAATT ACGTCCTCTA CCCGAACTTA AACCAAGTGG ATTACGTGGT 

30751 AACACAAATC CCCTCAAAAC AAAAATTGGC CATGGCCTAG AATTTGATTC 
TTGTGTTTAG GGGAGTTTTG TTTTTAACCG GTACCGGATC TTAAACTAAG 

30801 AAACAAGGCT ATGGTTCCTA AACTAGGAAC TGGCCTTAGT TTTGACAGCA 
TTTGTTCCGA TACCAAGGAT TTGATCCTTG ACCGGAATCA AAACTGTCGT 

30851 CAGGTGCCAT TACAGTAGGA AACAAAAATA ATGATAAGCT AACTTTGTGG 
GTCCACGGTA ATGTCATCCT TTGTTTTTAT TACTATTCGA TTGAAACACC 

30901 ACCACACCAG CTCCATCTCC TAACTGTAGA CTAAATGCAG AGAAAGATGC 
TGGTGTGGTC GAGGTAGAGG ATTGACATCT GATTTACGTC TCTTTCTACG 

30951 TAAACTCACT TTGGTCTTAA CAAAATGTGG CAGTCAAATA CTTGCTACAG 
ATTTGAGTGA AACCAGAATT GTTTTACACC GTCAGTTTAT GAACGATGTC 

31001 TTTCAGTTTT GGCTGTTAAA GGCAGTTTGG CTCCAATATC TGGAACAGTT 
AAAGTCAAAA CCGACAATTT CCGTCAAACC GAGGTTATAG ACCTTGtCAA 

31051 CAAAGTGCTC ATCTTATTAT AAGATTTGAC GAAAATGGAG TGCTACTAAA 
GTTTCACGAG TAGAATAATA TTCTAAACTG CTTTTACCTC ACGATGATTT 

31101 CAATTCCTTC CTGGACCCAG AATATTGGAA CTTTAGAAAT GGAGATCTTA 
GTTAAGGAAG GACCTGGGTC TTATAACCTT GAAATCTTTA CCTCTAGAAT 

31151 CTGAAGGCAC AGCCTATACA AACGCTGTTG GATTTATGCC TAACCTATCA 
GACTTCCGTG TCGGATATGT TTGCGACAAC CTAAATACGG ATTGGATAGT 

31201 GCTTATCCAA AATCTCACGG TAAAACTGCC AAAAGTAACA TTGTCAGTCA 
CGAATAGGTT TTAGAGTGCC ATTTTGACGG TTTTCATTGT AACAGTCAGT 
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AGTTTACTTA AACGGAGACA AAACTAAACC TGTAACACTA ACCATTACAC 
TCAAATGAAT TTGCCTCTGT TTTGATTTGG ACATTGTGAT TGGTAATGTG 

TAAACGGTAC ACAGGAAACA GGAGACACAA CTCCAAGTGC ATACTCTATG 
ATTTGCCATG TGTCCrrTGT CCTCTGTGTT GAGGTTCACG TATGAGATAC 

TCATTTTCAT GGGACTGGTC TGGCCACAAC TACATTAATG AAATATTTGC 
AGTAAAAGTA CCCTGACCAG ACCGGTGTTG ATGTAATTAC TTTATAAACG 

CACATCCTCT TACACTTTTT CATACATTGC CCAAGAATAA AGAATCGTTT 
GTGTAGGAGA ATGTGAAAAA 6TATGTAACG GGTTCTTATT TCTTAGCAAA 

GTGTTATGTT TCAACGTGTT TATTTTTCAA TTGCAGAAAA TTTCAAGTCA 
CACAATACAA AGTTGCACAA ATAAAAAGTT AACGTCTTTT AAAGTTCAGT 

TTTTTCATTC AGTAGTATAG CCCCACCACC ACATAGCTTA TACAGATCAC 
AAAAAGTAAG TCATCATATC OGGGTGGTGG TGTATCGAAT ATGTCTAGTG 

CGTACCTTAA TCAAACTCAC AGAACCCTAG TATTCAACCT GCCACCTCCC 
GCATGGAATT AGTTTGAGTG TCTTGGGATC ATAAGTTGGA CGGTGGAGGG 

TCCCAACACA CAGAGTACAC AGTCCTTTCT CCCCGGCTGG CCTTAAAAAG 
AGGGTTGTGT GTCTCATGTG TCAGGAAAGA GGGGCCGACC GGAATTTTTC 

CATCATATCA TGGGTAACAG ACATATTCTT AGGTGTTATA TTCCACACGG 
GTAGTATAGT ACCCATTGTC TGTATAAGAA TCCACAATAT AAGGTGTGCC 

TTTCCTGTCG AGCCAAACGC TCATCAGTGA TATTAATAAA CTCCCCGGGC 
AAAGGACAGC TCGGTTTGCG AGTAGTCACT ATAATTATTT GAGGGGCCCG 

AGCTCACTTA AGTTCATGTC GCTGTCCAGC TGCTGAGCCA CAGGCTGCTG 
TCGAGTGAAT TCAAGTACAG CGACAGGTCG ACGACTCGGT GTCCGACGAC 

TCCAACTTGC GGTTGCTTAA CGGGCGGCGA AGGAGAAGTC CACGCCTACA 
AGGTTGAACG CCAACGAATT GCCCGCCGCT TCCTCTTCAG GTGCGGATGT 

TGGGGGTAGA GTCATAATCG TGCATCAGGA TAGGGCGGTG GTGCTGCAGC 
ACCCCCATCT CAGTATTAGC ACGTAGTCCT ATCCCGCCAC CACGACGTCG 

AGCGCGCGAA TAAACTGCTG CCGCCGCCGC TCCGTCCTGC AGGAATACAA 
TCGCGCGCTT ATTTGACGAC GGCGGCGGCG AGGCAGGACG TCCTTATGTT 

CATGGCAGTG GTCTCCTCAG CGATGATTCG CACCGCCCGC AGCATAAGGC 
GTACCGTCAC CAGAGGAGTC GCTACTAAGC GTGGCGGGCG TCGTATTCCG 

GCCTTGTCCT CCGGGCACAG CAGCGCACCC T6ATCTCACT TAAATCAGCA 
CGGAACAG6A GGCCCGTGTC GTCGCGTGGG ACTAGAGTGA ATTTAGTCGT 

CAGTAACTGC AGCACAGCAC CACAATATTG TTCAAAATCC CACAGTGCAA 
GTCATTGACG TCGTGTCGTG GT6TTATAAC AAGTTTTAGG GTGTCACGTT 

GGCGCTGTAT CCAAAGCTCA TGGCGGGGAC CACAGAACCC ACGTGGCCAT 
CCGCGACATA GGTTTCGAGT ACCGCCCCTG GTGTCTTGGG TGCACCGGTA 

CATACCACAA GCGCAGGTAG ATTAAGTGGC GACCCCTCAT AAACACGCTG 
GTATGGTGTT CGCGTCCATC TAATTCACCG CTGGGGAGTA. TTTGTGCGAC 
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32251 CCATATAAAC CTCTGATTAA ACATGGCGCC ATCCACCACC ATCCTAAACC 
GGTATATTTG GAGACTAATT TGTACCGCGG TAGGTGGTGG TAGGATTTGG 

32301 AGCTGGCCAA AACCTGCCCG CCGGCTATAC ACTGCAGGGA ACCGGGACTG 
TCGACCGGTT TTGGACGGGC GGCCGATATG TGACGTCCCT TGGCCCTGAC 

32351 GAACAATGAC AGTGGAGAGC CCAGGACTCG TAACCATGGA TCATCAT6CT 
CTTGTTACTG TCACCTCTCG GGTCCTGAGC ATTGGTACCT AGTAGTACGA 

32401 CGTCATGATA TCAATGTTGG CACAACACAG GCACACGTGC ATACACTTCC 
GCAGTACTAT AGTTACAACC GTGTTGTGTC CGTGTGCACG TATGTGAAGG 

32451 TCAGGATTAC AAGCTCCTCC CGCGTTAGAA CCATATCCCA GGGAACAACC 
AGTCCTAATG TTCGAGGAGG 6CGCAATCTT GGTATAGGGT CCCTTGTTGG 

32501 CATTCCTGAA TCAGC6TAAA TCCCACACTG CAGGGAAGAC CTCGCACGTA 
GTAAGGACTT AGTCGCATTT AGGGTGTGAC GTCCCTTCTG GAGCGTGCAT 

32551 ACTCACGTTG TGCATTGTCA AAGTGTTACA TTCGGGCAGC AGCGGATGAT 
TGAGTGCAAC ACGTAACAGT TTCACAATGT AAGCCCGTCG TCGCCTACTA 

32601 CCTCCAGTAT GGTAGCGCGG GTTTCTGTCT CAAAAGGAGG TAGACGATCC 
GGAGGTCATA CCATCGCGCC CAAAGACAGA GTTTTCCTCC ATCTGCTAGG 

32651 CTACTGTACG GAGTGCGCCG AGACAACCGA GATCGTGTTG GTCGTAGTGT 
GATGACATGC CTCACGCGGC TCTGTTGGCT CTAGCACAAC CA6CATCACA 

32701 CATGCCAAAT GGAACGCCGG ACGTAGTCAT ATTTCCTGAA GCAAAACCAG 
GTACGGTTTA CCTTGCGGCC TGCATCAGTA TAAAGGACTT CGTTTTGGTC 

32751 GTGCGGGCG7 GACAAACAGA TCTGCGTCTC CGGTCTCGCC GCTTAGATCG 
CACGCCCGCA CTGTTTGTCT AGACGCAGAG GCCAGAGCGG CGAATCTAGC 

32801 CTCTGTGTAG TAGTTGTAGT ATATCCACTC TCTCAAAGCA TCCAGGCGCC 
GAGACACATC ATCAACATCA TATAGGTGAG AGAGTTTCGT AGGTCCGCGG 

32851 CCCTGGCTTC GGGTTCTATG TAAACTCCTT CATGCGCCGC TGCCCTGATA 
GGGACCGAAG CCCAAGATAC ATTTGAGGAA 6TACGCGGCG AC66GACTAT 

32901 ACATCCACCA CCGCAGAATA AGCCACACCC AGCCAACCTA CACATTCGTT 
TGTAGGTGGT GGCGTCTTAT TCGGTGTGGG TCGGTTGGAT GTGTAAGCAA 

32951 CTGCGAGTCA CACACGGGAG 6AGCGGGAAG AGCTGGAAGA ACCATGTTTT 
GACGCTCAGT GTGTGCCCTC CTCGCCCTTC TCGACCTTCT TGGTACAAAA 

33001 TTTTTTTATT CCAAAAGATT ATCCAAAACC TCAAAATGAA GATCTATTAA 
AAAAAAATAA GGTTTTCTAA TAGGTTTTG6 AGTTTTACTT CTAGATAATT 

33051 GTGAACGCGC TCCCCTCC6G TGGCGTGGTC AAACTCTACA GCCAAA6AAC 
CACTTGCGC6 AGGGGAGGCC ACCGCACCAG TTTGAGATGT CGGTTTCTTG 

33101 AGATAATGGC ATTTGTAAGA TGTTGCACAA TGGCTTCCAA AAGGCAAACG 
TCTATTACCG TAAACATTCT ACAACGTGTT ACC6AAGGTT TTCCGTTTGC 
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33201 CTCTATAAAC ATTCCAGCAC CTTCAACCAT GCCCAAATAA TTCTCATCTC 
GAGATATTTG TAAGGTCGTG GAAGTTGGTA CGGGTTTATT AAGAGTAGAG 

33251 GCCACCTTCT CAATATATCT CTAAGCAAAT CCCGAATATT AAGTCCGGCC 
CGGTGGAAGA GTTATATAGA GATTCGTTTA GGGCTTATAA TTCAGGCCGG 

33301 ATTGTAAAAA TCTGCTCCAG AGCGCCCTCC ACCTTCAGCC TCAAGCAGCG 
TAACATTTTT AGACGAGGTC TCGCGGGAGG TGGAAGTCGG AGTTCGTCGC 

33351 AATCATGATT GCAAAAATTC AGGTTCCTCA CAGACCTGTA TAAGATTCAA 
TTAGTACTAA CGTTTTTAAG TCCAAGGAGT GTCTGGACAT ATTCTAAGTT 

33401 AAGCGGAACA TTAACAAAAA TACCGCGATC CCGTAGGTCC CTTCGCAGGG 
TTCGCCTTGT AATTGTTTTT ATGGCGCTAG GGCATCCAGG GAAGCGTCCC 

33451 CCAGCTGAAC ATAATCGTGC AGGTCTGCAC GGACCAGCGC GGCCACTTCC 
GGTCGACTTG TATTAGCACG TCCAGACGTG CCTGGTCGCG CCGGTGAAGG 

33501 CCGCCAGGAA CCATGACAAA AGAACCCACA CTGATTATGA CACGCATACT 
GGCGGTCCTT GGTACTGTTT TCTTGGGTGT GACTAATACT GTGCGTATGA 

33551 CGGAGCTATG CTAACCAGCG TAGCCCCGAT GTAAGCTTGT TGCATGGGCG 
GCCTCGATAC GATTGGTCGC ATCGGGGCTA CATTCGAACA ACGTACCCGC 

33601 GCGATATAAA ATGCAAGGTG CTGCTCAAAA AATCAGGCAA AGCCTCGCGC 
CGCTATATTT TACGTTCCAC GACGAGTTTT TTAGTCCGTT TCGGAGCGCG 

33651 AAAAAAGAAA GCACATCGTA GTCATGCTCA TGCAGATAAA GGCAGGTAAG 
TTTTTTCTTT CGTGTAGCAT CAGTACGAGT ACGTCTATTT CCGTCCATTC 

337 01 CTCCGGAACC ACCACAGAAA AAGACACCAT TTTTCTCTCA AACATGTCTG 
GAGGCCTTGG TGGTGTCTTT TTCTGTGGTA AAAAGAGAGT TTGTACAGAC 

33751 CGGGTTTCTG CATAAACACA AAATAAAATA ACAAAAAAAC ATTTAAACAT 
GCCCAAAGAC GTATTTGTGT TTTATTTTAT TGTTTTTTTG TAAATTTGTA 

33801 TAGAAGCCTG TCTTACAACA GGAAAAACAA CCCTTATAAG CATAAGACGG 
ATCTTCGGAC AGAATGTTGT CCTTTTTGTT GGGAATATTC GTATTCTGCC 

33851 ACTACGGCCA TGCCGGCGTG ACCGTAAAAA AACTGGTCAC CGTGATTAAA 
TGATGCCGGT ACGGCCGCAC TGGCATTTTT TTGACCAGTG GCACTAATTT 

33901 AAGCACCACC GACAGCTCCT CGGTCATGTC CGGAGTCATA ATGTAAGACT 
TTCGTGGTGG CTGTCGAGGA GCCAGTACAG GCCTCAGTAT TACATTCTGA 

33951 CGGTAAACAC ATCAGGTTGA TTCACATCGG TCAGTGCTAA AAAGCGACCG 
GCCATTTGTG TAGTCCAACT . AAGTGTAGCC AGTCACGATT TTTCGCTGGC 

34001 AAATAGCCCG GGGGAATACA TACCCGCAGG CGTAGAGACA ACATTACAGC 
TTTATCGGGC CCCCTTATGT ATGGGC6TCC GCATCTCTGT TGTAATGTCG 

34051 CCCCATAGGA GGTATAACAA AATTAATAGG AGAGAAAAAC ACATAAACAC 
GGGGTATCCT CCATATTGTT TTAATTATCC TCTCTTTTTG TGTATTTGTG 
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34151 
34202 
34251 
34301 
34351 
34401 
34451 
34501 

34551 
34601 
34651 
34701 
34751 
34B01 
34651 
34901 
34951 
35001 



ACATACAGCG CTTCCACAGC GGCAGCCATA ACAGTCAGCC TTACCAGTAA 
TGTATGTCGC GAAGGTGTCG CCGTCGGTAT TGTCAGTCGG AATGGTCATT 

AAAAGAAAAC CTATTAAAAA AACACCACTC GACACGGCAC CAGCTCAATC 
TTTTCTTTTG GATAATTTTT TTGTGGTGAG CTGTGCCGTG GTCGAGTTAG 

AGTCACAGTG TAAAAAAG6G CCAAGTGCAG AGCGAGTATA TATAGGACTA 
TCAGTGTCAC ATTTTTTCCC GGTTCACGTC TCGCTCATAT ATATCCTGAT 

AAAAATGACG TAACGGTTAA AGTCCACAAA AAACACCCAG AAAACCGCAC 
TTTTTACTGC ArPGCCAATT TCAGGTGTTT TTTGTGGGTC TTTTGGCGTG 

GCGAACCTAC GCCCAGAAAC GAAAGCCAAA AAACCCACAA CTTCCTCAAA 
CGCTTGGATG CGGGTCTTTG CTTTCGGTTT TTTGGGTGTT GAAGGAGTTT 

TCGTCACTTC CGTTTTCCCA CGTTACGTCA CTTCCCATTT TAAGAAAACT 
AGCAGTGAAG GCAAAAGGGT 6CAATGCAGT GAAGGGTAAA ATTCTTTTGA 

ACAATTCCCA ACACATACAA GTTACTCCGC CCTAAAACCT ACGTCACCCG 
T6TTAAGGGT TGTGTATGTT CAATGAGGCG GGATTTTGGA TGCAGTGGGC 

CCCCGTTCCC ACGCCCCGCG CCACGTCACA AACTCCACCC CCTCATTATC 
GGGGCAAGGG TGCGGGGCGC GGTGCAGTGT TTGAGGTGGG GGAGTAATAG 



ATATTGGCTT CAATCCAAAA TAAGGTATAT TATTGATGAT GTTAATTAAG 
TATAACCGAA GTTAGGTTTT ATTCCATATA ATAACTACTA CAATTAATTC 

AATTCGGATC TGCGACGCGA GGCTGGATGG CCTTCCCCAT TATGATTCTT 
TTAAGCCTAG ACGCTGCGCT CCGACCTACC GGAAGGGGTA ATACTAAGAA 

CTCGCTTCCG GCGGCATCGG 6ATGCCCGCG TTGCAGGCCA TGCTGTCCAG 
GAGCGAAGGC CGCCGTAGCC CTACGGGCGC AACGTCCGGT ACGACAGGTC 

GCAGGTAGAT GACGACCATC AGGGACAGCT TCAAGGCCAG CAAAAGGCCA 
CGTCCATCTA CTGCTGGTAG TCCCTGTCGA AGTTCCGGTC GTTTTCCGGT 

GGAACCGTAA AAAGGCCGCG TTGCTGGCGT TTTTCCATAG GCTCCGCCCC 
CCTTGGCATT TTTCCGGCGC AACGACCGCA AAAA6GTATC CGAGGCGGGG 

CCTGACGAGC ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC 
GGACTGCTCG TAGTGTTTTT AGCTGCGAGT TCAGTCTCCA CCGCTTTGGG 

GACAGGACTA TAAAGATACC AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC 
CTGTCCTGAT ATTTCTATGG TCCGCAAAGG GGGACCTTCG A6GGAGCACG 

GCTCTCCTGT TCCGACCCTG CCGCTTACCG GATACCTGTC CGCCTTTCTC 
CGAGAGGACA AGGCTGGGAC GGCGAATGGC CTATGGACAG GCGGAAAGAG 

CCTTCGGGAA GCGTGGCGCT TTCTCATAGC TCACGCTGTA GGTATCTCAG 
GGAAGCCCTT CGCACCGCGA AAGAGTATCG AGTGCGACAT CCATAGAGTC 

TTCGGTGTAG GTCGTTCGCT CCAAGCTGGG CTGTGTGCAC GAACCCCCCG 
AAGCCACATC CAGCAAGCGA GGTTCGACCC GACACACGTG CTTGGGGGGC 
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35101 
35151 
35201 
35251 
35301 
35351 
35401 
35451 
35501 
35551 
35601 
35651 
35701 
35751 
35801 
35B51 
35901 
35951 



AAGTCGGGCT GGCGACGCGG AATAG6CCAT TGATAGCA6A ACTCAGGTTG 

CCGGTAAGAC ACGACTTATC GCCACTGGCA GCAGCCACTG GTAACAGGAT 
GGCCATTCTG TGCTGAATAG CGGTGACCGT CGTCGGTGAC CATTGTCCTA 

TAGCAGAGCG AGGTATGTAG GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC 
ATCGTCTCGC TCCATACATC CGCCACGATG TCTCAAGAAC TTCACCACCG 

CTAACTACGG CTACACTAGA AGGACAGTAT TTGGTATCTG CGCTCTGCTG 
6ATTGATGCC GATGTGATCT TCCTGTCATA AACCATAGAC GCGAGACGAC 

AAGCCAGTTA CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT CCGGCAAACA 
TTCGGTCAAT GGAAGCCTTT TTCTCAACCA TCGAGAACTA GGCCGTTTGT 

AACCACCGCT GGTAGCGGTG GTTTTTTTGT TTGCAAGCAG CAGATTACGC 
TTGGTGGCGA CCATCGCCAC CAAAAAAACA AACGTTCGTC GTCTAATGCG 

GCAGAAAAAA AGGATCTCAA GAAGATCCTT TGATCTTTTC TACGGGGTCT 
CGTCTTTTTT TCCTAGAGTT CTTCTAGGAA ACTAGAAAAG ATGCCCCAGA 

GACGCTCAGT GGAACGAAAA CTCACGTTAA GGGATTTTGG TCATGAGATT 
CTGCGAGTCA CCTTGCTTTT GAGTGCAATT CCCTAAAACC AGTACTCTAA 

ATCAAAAAGG ATCTTCACCT AGATCCTTTT AAATCAATCT AAAGTATATA 
TAGTTTTTCC TAGAAGTGGA TCTAGGAAAA TTTAGTTAGA TTTCATATAT 

TGAGTAAACT TGGTCTGACA GTTACCAATG CTTAATCAGT GAGGCACCTA 
ACTCATTTGA ACCAGACTGT CAATGGTTAC GAATTAGTCA CTCCGTGGAT 

TCTCAGCGAT CTGTCTATTT CGTTCATCCA TAGTTGCCTG ACTCCCCGTC 
AGAGTCGCTA GACAGATAAA GCAAGTAGGT ATCAACGGAC TGAGGGGCAG 

GTGTAGATAA CTACGATACG GGAGGGCTTA CCATCT6GCC CCAGTGCTGC 
CACATCTATT GATGCTATGC CCTCCCGAAT GGTAGACCGG GGTCACGACG 

AATGATACCG CGAGACCCAC GCTCACCGGC TCCAGATTTA TCAGCAATAA 
TTACTATGGC GCTCTGGGTG CGAGTGGCCG AGGTCTAAAT AGTCGTTATT 

ACCAGCCAGC CGGAAGGGCC GAGCGCAGAA GTGGTCCTGC AACTTTATCC 
TGGTCGGTCG GCCTTCCCGG CTCGCGTCTT CACCAGGACG TTGAAATAGG 

GCCTCCATCC AGTCTATTAA TTGTTGCCGG GAAGCTAGAG TAAGTAGTTC 
CG6AGGTAGG TCAGATAATT AACAACGGCC CTTCGATCTC ATTCATCAAG 

GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC CATTGCTACA GGCATCGTGG 
CGGTCAATTA TCAAACGCGT TGCAACAACG GTAACGATGT CCGTAGCACC 

TGTCACGCTC GTCGTTTGGT ATGGCTTCAT TCAGCTCCGG TTCCCAACGA 
ACAGTGCGAG CAGCAAACCA TACCGAAGTA AGTCGAGGCC AAGGGTTGCT 

TCAAGGCGAG TTACATGATC CCCCATGTTG TGCAAAAAAG CGGTTAGCTC 
A6TTCCGCTC AATGTACTAG GGGGTACAAC ACGTTTTTTC GCCAATC6AG 

CTTCGGTCCT CCGATCGTTG TCAGAAGTAA GTTGGCCGCA GTGTTATCAC 
GAAGCCAGGA GGCTAGCAAC A6TCTTCATT CAACCGGCGT CACAATAGTG 
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36051 AGATGCTTTT CTGTGACTGG TGAGTACTCA ACCAAGTCAT TCTGAGAATA 
TCTACGAAAA GACACTGACC ACTCATGAGT TGGTTCAGTA AGACTCTTAT 

36101 GTGTATGCGG CGACCGAGTT GCTCTTGCCC GGCGTCAACA CGGGATAATA 
CACATACGCC GCTGGCTCAA CGAGAACGGG CCGCAGTTGT GCCCTATTAT 

36151 CCGCGCCACA TAGCAGAACT TTAAAAGTGC TCATCATTGG AAAACGTTCT 
GGCGCGGTGT ATCGTCTTGA AATTTTCACG AGTAGTAACC TTTTGCAAGA 

36201 TCGGGGCGAA AACTCTCAAG GATCTTACCG CTGTTGAGAT CCAGTTCGAT 
AGCCCCGCTT TTGAGAGTTC CTAGAATGGC GACAACTCTA GGTCAAGCTA 

36251 GTAACCCACT CGTGCACCCA ACTGATCTTC AGCATCTTTT ACTTTCACCA 
CATTGGGTGA GCACGTGGGT TGACTAGAAG TCGTAGAAAA TGAAAGTGGT 

36301 GCGTTTCTCG GTGAGCAAAA ACAGGAAGGC AAAATGCC6C AAAAAAGGGA. 

CGCAAAGACC CACTCGTTTT TGTCCTTCCG TTTTACGGCG TTTTTTCCCf 

36351 ATAAGGGCGA CACGGAAATG TTGAATACTC ATACTCTTCC TTTTTCAATA 
TATTCCCGCT GTGCCTTTAC AACTTATGAG TATGAGAAGG AAAAAGTTAT 

36401 TTATTGAAGC ATTTATCAGG GTTATTGTCT CATGAGCGGA TACATATTTG 
AATAACTTCG TAAATAGTCC CAATAACAGA GTACTCGCCT ATGTATAAAC 

36451 AATGTATTTA GAAAAATAAA CAAATAGGGG TTCCGCGCAC ATTTCCCC'GA 
TTACATAAAT CTTTTTATTT GTTTATCCCC AAGGCGCGTG TAAAGGGGCT 

36501 AAAGTGCCAC CTGACGTCTA AGAAACCATT ATTATCATGA CATTAACCTA 
TTTCACGGTG GACTGCAGAT TCTTTGGTAA TAATAGTACT GTAATTGGAT 

36551 TAAAAATAGG CGTATCACGA GGCCCTTTCG TCTTCAAGAA TTGGATCCGA 
ATTTTTATCC GCATAGTGCT CCGGGAAAGC AGAAGTTCTT AACCTAGGCT 

Pad 



36601 ATTCTTAATT TCTTAATTAA (SEQ ID NO: 34) 
TAAGAATTAA AGAATTAATT' (SEQ ID NO: 35) 
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VIRUS (P6 ) PLASMID VIRUS (P21) 
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AEX*HPLC Cell-AssQCiaied Viral Particte Concentration 
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atg gat gca atg aag aga ggg etc tgc tgt gtg ctg ctg ctg tgt gga 48 
Met Asp Ala Met hys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 

gca gtc ttc gtt teg ccc age gag ate tec ett gtg tgg gee tec agg 96 
Ala Val Phe Val Ser Pro Ser Glu lie Ser lie Val Trp Ala Ser Arg 
20 25 30 

gag ctg gag agg ttt get gtg aac cct ggc ctg ctg gag acc tct gag 144 
Glu Leu Glu Arg Phe Ala Val Asn Pro Gly Leu Leu Glu Thr Ser Glu 
35 40 45 

ggg tgc agg cag ate ctg ggc eag etc cag ccc tec ctg caa aca ggc 192 
Gly Cys Arg Gin lie Leu Gly Gin Leu Gin Pro Ser Leu Gin Thr Gly, 
50 55 60 

tct gag gag ctg agg tec ctg tac aac aca gtg get ace ctg tac tgt 240 
Ser Glu Glu Leu Arg Ser Leu Tyr Asn Thr Val Ala Thr Leu Tyr Cys 
65 70 75 80 

gtg cac cag aag att gat gtg aag gac acc aag gag gee ctg gag aag 288 
Val His Gin Lys lie Asp Val Lys Asp Thr Lys Glu Ala Leu Glu Lys 
85 90 95 

att gag gag gag cag aac aag tec aag aag aag gee cag cag get get 336 
lie Glu Glu Glu Gin Asn Lys Ser Lys Lys Lys Ala Gin Gin Ala Ala 
100 105 110 

get ggc aca ggc aac tee age cag gtg tec cag aac tac ccc att gtg 384 
Ala Gly Thr Gly Asn Ser Ser Gin Val Ser Gin Asn Tyr Pro lie Val 
115 120 125 

eag aac etc cag ggc eag atg gtg cac cag gee ate tee ccc egg ace 432 
Gin Asn Leu Gin Gly Gin Met Val His Gin Alia He Ser Pro Arg Thr 
130 135 140 

ctg aat gee tgg gtg aag gtg gtg gag gag aag gee ttc tee cct gag 480 
Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro Glu 
145 150 155 160 

gtg ate ccc atg ttc tct gee ctg tct gag ggt gee acc ccc eag gac 528 
Val He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin Asp 
165 170 175 

ctg aac acc atg ctg aac aca gtg ggg ggc cat cag get gee atg cag 576 
Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala Ala Met Gin 
180 185 190 

atg ctg aag gag acc ate aat gag gag get get gag tgg gac agg ctg 624 
Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu Trp Asp Arg Leu 
195 200 205 

cat cct gtg cac get ggc ccc att gee ccc ggc cag atg agg gag ccc 672 
His Pro Val His Ala Gly Pro He Ala Pro Gly 61n Met Arg Glu Pro 
210 215 220 

agg ggc tct gac att get ggc acc acc tec acc etc cag gag cag att 720 
Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin He 
225 230 235 240 

ggc tgg atg acc aac aac ccc ccc ate cct gtg ggg gaa ate tec aag 768 
Gly Trp Met Thr Asn Asn Pro Pro He Pro Val Gly Glu He Tyr Lys 
245 250 255 
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agg tgg etc ate ctg ggc ctg aac aag att gtg agg atg tac tec ccc 816 
Arg Trp He He Leu Gly Leu Asn Lys He Val Arg Met Tyr Ser Pro 
260 265 270 

acc tec ate ctg gac ate agg cag ggc ccc aag gag ccc ttc agg gac 864 
Thr Ser lie Leu Aap He Arg Gin Gly Fro Lys Glu Pro Phe Arg Asp 
275 280 285 

tat gtg gac agg ttc tac aag acc ctg agg get gag cag gcc tec cag 912 
lyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Ser Gin 
290 295 300 

gag gtg aag aac tgg atg aca gag acc ctg ctg gtg cag aat gcc aac 960 
Glu Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin Asn Ala Asn 
305 310 315 320 

cet gac tgc aag acc ate ctg aag gcc ctg ggc cct get gcc acc ctg 1008 
Pro Asp Cys Lys Thr He Leu Lys Ala Leu Gly Pro Ala Ala Thr Leu 
325 330 335 

gag gag atg atg aca gcc tgc cag ggg gtg ggg ggc cct ggt cac aag 1056 
Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His Lys 
340 345 350 

gcc agg gtg ctg get gag gcc atg tec cag gtg acc aac tec gcc acc 1104 
Ala Arg Val Leu Ala Glu Ala Met Ser Gin Val Thr Asn Ser Ala Thr 
355 360 365 

ate atg atg cag agg ggc aac ttc agg aac cag agg aag aca gtg aag 1152 
He Net Met Gin Arg Gly Aen Phe Arg Asn Gin Arg Lys Thr Val Lys 
370 375 380 

tgc ttc aac tgt ggc aag gtg ggc cac att gcc aag aac tgt agg gcc 1200 
Cys Phe Asn Cys Gly Lys Val Gly His He Ala Lys Asn Cys Arg Ala 
385* 390 395 400 

ccc agg aag aag ggc tgc tgg aag tgt ggc aag gag ggc cac cag atg 1248 
Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly His Gin Met 
405 410 415 

aag gac tgc aat gag agg cag gcc aac ttc ctg ggc aaa ate tgg ccc 1296 
Lys Asp Cys Asn Glu Arg Gin Ala Asn Phe Leu Gly Lys He Trp Pro 
420 425 430 

tec cac aag ggc agg cct ggc aac ttc etc cag tec agg cct gag ccc 1344 
Ser His Lys Gly Arg Pro Gly Asn Phe Leu Gin Ser Arg Pro Glu Pro 
435 440 445 

aca gcc cct ccc gag gag tec ttc agg ttt ggg gag gag aag acc acc 1392 
Thr Ala Pro Pro Glu Glu Ser Phe Arg Phe Gly Glu Glu Lys Thr Thr 
450 455 460 

ccc age cag aag cag gag ccc att gac aag gag ctg tac ccc ctg gcc 1440 
Pro Ser Gin Lys Gin Glu Pro He Asp Lys Glu Leu Tyr Pro Leu Ala 
465 470 475 480 

tec ctg agg tec ctg ttt ggc aac gac ccc tec tec cag tea <5ID M0:36) 1482 
Ser Leu Arg Ser Leu Phe Gly Asn Asp Pro Ser Ser Gin * (SID NO: 37) 
465 490 
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FIGURE 33A 

ATGGGTGCTA GGGCTTCTGT GCTGTCTGGT GGTGAGCTGG ACAAGTGGGA GAAGATCAGG 
CTGAGGCCTG GTGGCAAGAA GAAGTACAAG CTAAAGCACA TTGTGTGGGC CTCCAGGGAG 
CT6GAGAGGT TTGCT6TGAA CCCT6GCCTG CTGGAGACCT CTGAG6GGTG CAGGCAGATC 
CTGGGCCAGC TCCAGCCCTC CCTGCAAACA GGCTCTGAGG AGCTGAGGTC CCTGTACAAC 
ACAGTGGCTA CCCTGTACTG TGTGCACCAG AAGATTGATG TGAAGGACAC CAAGGAGGCC 
CTGGAGAAGA TTGAGGAGGA GCAGAACAAG TCCAAGAAGA AGGCCCAGCA GGCTGCTGCT 
GGCACAGGCA ACTCCAGCCA G6TGTCCCA6 AACTACCCCA TTGTGCAGAA CCTCCA6GGC 
CAGATGGTGC ACCAGGCCAT CTCCCCCCGG ACCCTGAATG CCTGGGTGAA GGTGGTGGAG 
GAGAAGGCCT TCTCCCCT6A GGTGATCCCC ATGTTCTCTG CCCTGTCTGA GGGTGCCACC 
CCCCAGGACC TGAACACCAT GCTGAACACA GTG6GGG6CC ATCAG6CTGC CATGCAGATG 
CTGAA6GAGA CCATCAATGA GGAGGCTGCT GA6T66GACA GGCTGCATCC TGTGCACGCT 
GGCCCCATTG CCCCCGGCCA GATGAGGGAG CCCAGGGGCT CTGACATTGC TGGCACCACC 
TCCACCCTCC AGGAGCAGAT TGGCTGGATG ACCAACAACC CCCCCATCCC TGTGGGGGAA 
ATCTACAAGA GGTGGATCAT CCTGGGCCTG AACAAGATTG TGAGGATGTA CTCCCCCACC 
TCCATCCTGG ACATCAGGCA GGGCCCCAAG GAGCCCTTCA GGGACTATGT GGACAGGTTC 
TACAAGACCC TGAGGGCTGA GCAGGCCTCC CAGGAGGTGA AGAACTGGAT GACAGAGACC 
CTGCTGGTGC AGAAT6CCAA CCCTGACTGC AAGACCATCC TGAAGGCCCT GGGCCCTGCT 
GCCACCCTGG AG6AGATGAT 6ACAGCCTGC CAGGGGGTGG GGGGCCCTGG TCACAAGGCC 
AGGGTGCTGG CTGAGGCCAT GTCCCAGGTG ACCAACTCCG CCACCATCAT GATGCAGAGG 
GGCAACTTCA GGAACCAGAG GAAGACAGTG AAGTGCTTCA ACTGTGGCAA GGTGGGCCAC 
ATX-v^CCAAGA ACTGTAG6GC CCCCAGGAAG -AAGGGCTGCT 6GAAGTGTGG CAAGGAGGGC 
CACCAGATGA AGGACTGCAA TGAGAGGCAG GCCAACTTCC TGGGCAAAAT CTGGCCCTCC 
CACAAGGGCA GGCCTGGCAA CTTCCTCCAG TCCAGGCCTG AGCCCACAGC CCCTCCCGAG 
GAGTCCTTCA GGTTTGGGGA GGAGAAGACC ACCCCCAGCC AGAAGCAGGA GCCCATTGAC 
AAGGAGCTGT ACCCCCTGGC CTCCCTGAGG TCCCTGTTTG GCAACGACCC CTCCTCCCAG 
ATGGCTCCCA TCTCCCCCAT TGAGACTGTG CCTGTGAAGC TGAAGCCT6G CATGGATGGC 
CCCAAGGTGA AGCAGTGGCC CCTGACTGAG GAGAAGATCA AGGCCCTGGT GGAAATCTGC 
ACTGAGATGG AGAAGGA6GG CAAAATCTCC AAGATTG6CC CCGAGAACCC CTACAACACC 
CCTGT6TTTG CCATCAAGAA GAAGGACTCC ACCAAGTGGA GGAAGCTGGT GGACTTCAGG 
GAGCTGAACA AGAGGACCCA GGACTTCTGG GAGGTGCAGC TGGGCATCCC CCACCCCGCT 
6GCCTGAAGA A6AAGAAGTC TGTGACTGTG CTGGCTGTGG GGGATGCCTA CTTCTCTGTG 
CCCCTGGATG AGGACTTCAG GAAGTACACT GCCTTCACCA TCCCCTCCAT CAACAATGAG 
ACCCCTGGCA TCAGGTACCA GTACAATGTG CTGCCCCAGG GCTGGAAGGG CTCCCCTGCC 
ATCTTCCAGT CCTCCATGAC CAAGATCCTG GAGCCCTTCA GGAAGCAGAA CCCTGACATT 
GTGATCTACC AGTACATGGC TGCCCTGTAT GTGGGCTCTG ACCTGGAGAT TGGGCAGCAC 
AGGACCAAGA TT6AGGAGCT GAGGCAGCAC CTGCTGAGGT GGGGCCTGAC CACCCCTGAC 
AAGAAGCACC AGAAGGAGCC CCCCTTCCTG TGGATGGGCT ATGAGCTGCA CCCCGACAAG 
T6GACTGTGC AGCCCATTGT GCT6CCTGAG AAGGACTCCT GGACTGTGAA TGACATCCAG 
AAGCTGGTGG GCAAGCT6AA CTGGGCCTCC CAAATCTACC CT6GCATCAA GGTGAGGCAG 
CTGTGCAAGC TGCTGAGGGG CACCAAGGCC CTGACTGAGG TGATCCCCCT GACTGAGGAG 
6CT6AGCTGG AGCTGGCTGA GAACAGGGAG ATCCTGAAGG AGCCTGTGCA TGGGGTGTAC 
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FIGURE 33B 

TATGACCCCT CCAAGGACCT GATTGCTGAG ATCCAGAAGC AGGGCCAGGG CCAGTGGACC 
TACCAAATCT ACCAGGAGCC CTTCAAGAAC CTGAAGACTG GCAAGTATGC CAGGATGAGG 
GG6GCCCACA CCAATGATGT GAAGCAGCTG ACTGAGGCTG TGCAGAAGAT CACCACTGAG 
TCCATT6T6A TCTGGGGCAA GACCCCCAAG TTCAAGCTGC CCATCCAGAA GGAGACCTGG 
GAGACCTGGT GGACTGAGTA CTGGCAGGCC ACCTG6ATCC CTGAGTGGGA GTTTGTGAAC 
ACCCCCCCCC TGGTGAAGCT GTGGTACCAG. CTGGAGAAGG AGCCCATTGT GGGG6CTGAG 
ACCTTCTATG TGGCTGGGGC TGCCAACAGG GAGACCAAGC TGGGCAAGGC TGGCTATGT6 
ACCAACAGGG GCAGGCA6AA GGTGGTGACC CTGACTGACA CCACCAACCA GA^iGACTGCC 
CTCCAGGCCA TCTACCTGGC CCTCCAGGAC TCTGGCCTGG AGGTGAACAT TGT6ACTGCC 
TCCCAGTAT6 CCCTGGGCAT CATCCAGGCC CAGCCTGATC AGTCTGAGTC TGAGCTGGTG 
AACCAGATCA TTGAGCAGCT GATCAAGAAG GA6AAGGTGT ACCTGGCCTG GGTGCCTGCC 
CACAAGGGCA TTGGGGGCAA TGAGCAGGTG GACAAGCTGG TGTCTGCTGG CATCAG6AAG 
GTGCTGTTCC TGGATGGCAT TGACAAGGCC CAGGATGAGG ATGAGAAGTA CCACTCCAAC 
TGGAGGGCTA TGGCCTCTGA CTTCAACCTG CCCCCTGTG6 TGGCTAAGGA GATTGTGGCC 
TCCTGTGACA AGTGCCAGCT GAAGGGGGAG GCCATGCATG GGCAGGTGGA CTGCTCCCCT 
GGCATCTGGC AGCTGGCCTG CACCCACCTG GAGGGCAAGG TGATCCTGGT GGCTGTGCAT 
GTGGCCTCCG GCTACATTGA GGCTGAGGTG ATCCCTGCTG AGACAGGCCA GGAGACTGCC 
TACTTCCTGC TGAAGCTGGC TGGCAGGTGG CCTGTGAAGA CCATCCACAC TGCCAATGGC 
TCCAACTTCA CTGGGGCCAC AGTGAGGGCT GCCTGCTGGT GGGCTGGCAT CAAGCAGGAG 
TTTCGCATCC CCTACAACCC CCAGTCCCAG GGGGTGGTGG CCTCCATGAA CAAGGAGCTG 
AAGAAGATCA TTGGGCAGGT GAGGGACCAG GCTGAGCACC TGAAGACAGC TGTGCAGATG 
GCTGTGTTCA TCCACAACTT CAAGAGGAAG GGGGGCATCG GGGGCTACTC CGCTGGGGAG 
AGGATTGTGG ACATCATTGC CACAGACATC CAGACCAAGG AGCTCCAGAA GCAGATCACC 
AAGATCCAGA ACTTCAGGGT GTACTACAGG GACTCCAGGA ACCCCCTGTG GAAGGGCCCT 
GCCAAGCT6C TGTGGAAGGG GGAGGGGGCT 6TGGTGATCC AGGACAACTC TGACATCAAG 
GTGGTGCCCA GGAGGAAGGC CAAGATCATC AGGGACTATG GCAAGCAGAT GGCTGGGGAT 
GACTGTGTGG CCTCCAGGCA GGATGAGGAC TAA 
SEQ ID NO: 38 
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FIGURE a4A 
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FIGURE 34B 

Asp Lys Trp Thr Val Gin Pro lie Val Leu Pro Glai Lys Asp Ser Trp Thr Val 
Asn Asp lie Gin Lys Leu Val Gly Lys Leu Asn Trp Ala Ser Gin lie Tyr Pro 
Gly He Lys Val Arg Gin Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr 
Glu Val He Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu 
He Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu He 
Ala Glu He Gin Lys Gin Gly Gin Gly Gin Trp Thr Tyr Gin He Tyr Gin Glu 
Pro Phe Lys Asn Leu Lys Thr Gly Lys Tyr Ala Arg Met Arg Gly Ala His Thr 
Asn Asp Val Lys Gin Leu Thr Glu Ala Val Gin Lys He Thr Thr Glu Ser He 
Val He Trp Gly Lys Thr Pro Lys Phe Lys Leu Pro He Gin Lys Glu Thr Trp 
Glu Thr Trp Trp Thr Glu Tyr Trp Gin Ala Thr Trp He Pro Glu Trp Glu Phe 
Val Asn Thr Pro Pro Leu Val Lys Leu Trp Tyr Gin Leu Glu Lys Glu Pro He 
Val Gly Ala Glu Thr Phe Tyr Val Ala Gly Ala Ala Asn Arg Glu Thr Lys Leu 
Gly Lys Ala Gly Tyr Val Thr Asn Arg Gly Arg Gin Lys Val Val Thr Leu Thr 
Asp Thr Thr Asn Gin Lys Thr Ala Leu Gin Ala He Tyr Leu Ala Leu Gin Asp 
Ser Gly Leu Glu Val Asn He Val Thr Ala Ser Gin Tyr Ala Leu Gly He He 
Gin Ala Gin Pro Asp Gin Ser Glu Ser Glu Leu Val Asn Gin He He Glu Gin 
Leu He Lys Lys Glu Lys Val Tyr Leu Ala Trp Val Pro Ala His Lys Gly He 
Gly Gly Asn Glu Gin Val Asp Lys Leu Val Ser Ala Gly He Arg Lys Val Leu 
Phe Leu Asp Gly He Asp Lys Ala Gin Asp Glu His Glu Lys Tyr His Ser Asn 
Trp Arg Ala Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu He 
Vaj. Ala Ser Cys Asp Lys Cys Gin Leu Lys Gly Glu Ala Met His Gly Gin Val 
Asp Cys Ser Pro Gly He Trp Gin Leu Ala Cys Thr His Leu Glu Gly Lys Val 
He Leu Val Ala Val His Val Ala Ser Gly Tyr He - Glu Ala Glu Val He Pro 
Ala Glu Thr Gly Gin Glu Thr Ala Tyr Phe Leu Leu Lys Leu Ala Gly Arg Trp 
Pro Val Lys Thr He His Thr Ala Asn Gly Ser Asn Phe' Thr Gly Ala Thr Val 
Arg Ala Ala Cys Trp Trp Ala Gly He Lys Gin Glu Phe Gly He Pro Tyr Asn 
Pro Gin Ser Gin Gly Val Val Ala Ser Met Asn Lys Glu Leu Lys Lys He He 
Gly Gin Val Arg Asp Gin Ala Glu His Leu Lys Thr Ala Val Gin Met Ala Val 
Phe He His Asn Phe Lys Arg Lys Gly Gly Jle Gly Gly Tyr Ser Ala Gly Glu 
Arg He Val Asp He He Ala Thr Asp He Gin Thr Lys Glu Leu Gin Lys Gin 
He Thr Lys He Gin Asn Phe Arg Val Tyr Tyr Arg Asp Ser Arg Asn Pro Z#eu 
Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly Glu Gly Ala Val Val He Gin 
Asp Asn Ser Asp He Lys Val Val Pro Arg Arg Lys Ala Lys He He Arg Asp 
Tyr Gly Lys Gin Met Ala Gly Asp Asp Cys Val Ala Ser Arg Gin Asp Glu Asp 
SEQ ID NO: 39 
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1 


t-3,»-ll, 
13-18.29, 
30,31.32, 
34. 35. 37 


The claims are directed to an adenoviral vector tbat is at least partially deleted of 
AEl. the vector «^*««^ the cis-acting packaging wMpifncg of the wild type 
adenovirus genome, and a gene which encodes an HIV Gas protein (SEO ID NO: 29) 
inserted in the parallel arientadon of El. In addition die vector contains a nromoter 
and a polyadenylation sigud. 


2 


6. 7. 36 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl and AE3. die vector contains the cis-acting packaging sequence of the wild type 


adenovirus genome, and a gene M^ch encodes an HIV Gag protein (SEQ ID NO: 
29). 


3 


12, 33 


The claims are (firected lo an adenoviral vector that is at least partially deleted of 
AEl . (be vector contains the cis-acting packaging sequence of die wild type 
adenovirus genome, and a gene which encodes an HIV protein inserted in the 
antiparallel orientation of El . 


4 


19-23. 3^.42 


The claims are directed to a method of imlring and harvesting of a recombinant 
adenoviral particle that contains a gene encocting an HIV Car protein. 




1A. 97 7a 
A/. XO| 

43,46,47 


The claim is directed to a method of generating a cellular mediated immune response 
to HIV Gag protein widi the recombinant adenoviral particle. 


6 


25,26.44. 
45 


The claim is directed to a method of generating a ceUular mediated iintfiiine re^Mose 
to HIV Gag protein with the recombinant adeno^al particle in addition lo 
administering a DNA plasmid vaccine. 


7 


48-51.53. 
54.56 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl, the vector contains the cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Pol protein (SEO ID NO: 1) 
inserted in the parallel orientation of El . 


8 


48-51.53. 
54.56 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl. the vector contains the cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Pol protein (SEQ ID NO: 5) 
inserted in the parallel orientation of El . 


9 


48-51,53. 
54.56 


The claims are directed u> an adenoviral vector that is at least partially deleted of 
AEl, die vector contains the cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene vAnch encodes an HIV Pol protein (SEO ID NO: 7) 
inserted in the parallel orientation of El. 


10 


52 


The claim is directed to an adenoviral vector that is at least partially deleted of A£ 1 , 
die vector contains the cis-actii^ packaging sequence of the wild type adenovirus 
genome, and a gene which encodes an HIV Pol protein (SEO ID NO: 1) inserted in 
the antiparallel orientation of E 1 . 


11 


52 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl . 
the vector contains the cts-acting packaging sequence of the wild type adenovirus 
genome, and a gene wiiich encodes an HIV Pol protein (SEO ID NO: 5) inserted in 
(he antiparallel orientation of El . 


12 


52 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl , 
the vector contains the cis*acting packaging sequence of the wild type adenovirus 
genome, and a gene which encodes an HIV Pol protein (SEQ ID NO: 7) inserted in 
die aniiparallcl orieniadon of E 1 . 


13 


55 ^ 


The claim ts directed to an adenoviral vector that is at least partially deleted of AEl 
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and AE3 , (he vector contains the cis-aciing packaging sequence of the wild type 
adenovirus eenome. and a aene wAiich encodes an HIV Pol protein (SEQ ID NO: t) 


14 


55 


The claim is ejected to an adenoviral vector that is at least partially deleted of AEl 
and AE3 , the vector contains the cis-acting packaging sequence of the wild type 
adenovirus eenome. and a sene which encodes an HIV Pol protein (SEO ID NO: 5) 

ificj^gf in t* t 
lIi>C|lCO UI C>1. 


IS 


55 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 , the vector contains the cis-acting packaging sequence of the wild type 
adenovirus eenome. and a sene which encodes an HIV Pol protein (SEO ID NO: 7) 
inserted in E I . 


16 


57-61 


The claims are directed to a method of making and harvesting of a recambinant 
adenoviral particle that contains a gene encoding an HIV Pol protein. 


17 


62. 65, 66 


Tte claim is directed to a method of generating a cellular mediated inmune respoose 
to HTV Pd protein with ths recombmant adenoviral particle. 


18 


63.64 


The claim is directed to a method of generating a cellular mediated immune response 
to HIV Pol nrotein with the recombinant adenoviral particle in addition to 
administering a DNA plasnnd vaccine. 


19 


67-70, 72. 
73,75 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl, the vector contains the cis-aciii^ packaging sequence of the wild type 
adenovirus aenome. and a eene which encodes an HIV Nef protein <SEQ ID NO: 9) 
inserted in the parallel orieniaUon of El. 


20 


61 '70, 72, 
73,75 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl. the vector contains the cis-acting packaging sequence of the wild type 
adenovirus eenome. and a sene which encodes an HTV Nef protein (SEO ID NO: 1 1) 
inserted in Che paiaUel orieniatian of El. 


21 


67-70, 72, 
73.75 


1 DC Ciauns are Ouccteo w an aacnovirai vecior mai is ai icasi paruauy wcjctjcu di 
AEl, the vector contains the cis-acting packaging sequence of the wild type 
adenovirus senonie. and a Rene ^ch encodes an HIV Nef protein (SEQ ID NO: 13) 
inserted in the parallel orientation of EI. 


22 


67-70. 72. 
73.75 


The claims are directed to an adeooraal vector chat is at least partially deleted of 
AEl. the vector contains the cis-ectmg packaging sequence of the wild type 
adenovirus eenomB. and a eene which encodes an HIV Nef protein (SEO ID NO: 15) 
inserted in the parallel crientadon of £ 1 . 


23 


71 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl. 
the vector contains the cis-acttifg packaging sequence of Che wild type adenovirus 
eenome. and a eene wiiich encodes an HIV Nef protein (SEQ ID NO: 9) inserted in 
the amiparallel orientation of E 1 . 


24 


71 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl, 
the vector contains the cis-acting packaging sequence of die wild type adenovirus 
eenonK. and a sewwAiich encodes an HIV Nef protein (SEO ID NO: 1 1) inserted in 
the antiparallel orientation of El . 


75 


71 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl. 
the vector contains the cis-acting packaging sequence of the wQd type adenovirus 

o^ffmffw an<1 n o^TV* u/hich «icnd^ fin HIV Nef iumtetti fSEO ID NO* 1 3) inserted in 

the antiparallel orientation of E 1 . 


26 


71 


The claim is directed Co an adenoviral vector that is at least partially deleted of AEl. 
the vector contains the cis-acting packaging sequence of the wild type adenovirus 
eenoiK. and a eene which encodes an HIV Nef protem (SEQ ID NO: 15) inserted in 
the antiparallel orientation of El . 


27 


74 


Tte claim is (tirected to an adenoviral vector that is at least partially deleted of AEl 
and AE3 , the vector contains die cis-acting packaging sequence of the wild ^pe 
adenovirus eenome. and a Rene which encodes an HIV Nef protein (SEQ ID NO: 9) 
inserted in El . 


28 


74 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 . the vector contains the cis-acting packaging sequence of the wild type 
adenovirus eenome. and a eene vtAiich encodes an HIV Nef protein (SEQ ID NO: U) 
inserted in El. 


29 


74 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 . the vector contains the cis-acting packaging sequence of the wild type 



Form PCT/I5A/210 (second sheet) (July 1998) 



I>rrERNATIONAL SEARCH REPORT 



[nieniatioiial application No. 
PCT/US01/2886I 







adenovirus genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: 13) 
inserted in El. 


30 


74 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 • the vector contains the cis-aciing packaging sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Nef proiein (SEQ ID NO: 15) 
inserted in El. 


31 


76-80 


The claims are t&ecled to a method of and harvesting of a recombinant 
adenoviral particle that contains a gov encoding; an HIV Nef protein. 


32 


81.84,83 


The claims arc direcled to a method of generating a cellular mediated imm»n«» 
response to HIV Nef with the recombinant adenoviral particle. 


33 


82,83 


The claims are directed to a method of generating a cellular mediated imnmna 
response to HIV Nef with the recombinant adenoviral particle in addition to 
administering a DNA ptasmid vaccine. 


34 


86a 


The claim is drawn to a muItivBlem vaccine wherein gag, pol and Ji^are ejqpiessed 
from three individual vectcns. 


35 


86b, 88, 89 


The claims are drawn to a multivalent vaccine wherein g^g, pet and iv/are expressed 
from one individual vectors. 


36 


86c, 88 


The claims are drawn to a multivalent vaccine v^ierein gag, poi and nefaie expressed 
from two individual vectors, one expressing nef-pol fusion and one expressing gag. 


37 


86d, 87. 88 


The claims axe drawn to a multivalent vaccme wherein.gog,* pol and nef aie expressed 
from two individual voctbfs*. one expressing gag-pol fusion and one expressing nef. 


38 


86e, 88 


The claims are drawn to a multivalent vaccine v^ierem gag, pol and nef aire expressed 
from two individual vectors, one expressing nef'gag fusion and one expressing pol. 


39 


86f.88 


The claims are drawn to a multivalent vaccine wherein gag, pol and n£fm expressed 

from a single vectors as a fusion protein. 


40 


86g, 88 


The claims are drawn to a multivalent vaccine wherein gag and pol are expressed 

from two individual vectors. 


41 


86h. 88. 89 


The claims are drawn to a multivalent vaccine wherein gag and pol are expressed 

individually from one vector. 


42 


86i.8S 


The claims are drawn to a multivalent vaccine v^iereui pol and nef en eiqnessed 

from two individual vectors. 


43 


86j, 88. 89 


The claims are drawn to a multivalent vaccine wherein pol and nef art expressed 

from individually from one vector. 


44 


86k. 88 


The claims arc drawn to a multivalent vaccine wherein nefassA gag are expressed 

individually from one vector. 


45 


861. 88, 89 


The claims are drawn to a multivalent vaccine wherein nef and gag are expressed 

individually from one vector. 


46 


86m. 88 


The claims are drawn to a multivalent vaccine wherein gag and pol are expressed as a 

fusion protein from one vector. 


47 


86eu88 


The claims are drawn to a multivalent vaccine wherein /w/ and nef are expressed as a 

fustcm protein from one vector. 


48 


860.88 


The claims are drawn to a multivalent vaccine wherein nef and gag are expressed as a 
fusion protein from one vector. 



The inventions listed as Groups 1-48 do not relate to a single general inveniive GQtKq>t under PCT Rule 13.1 because, under PCT 
Rule 13.2. they lack the same or corresponding special technical features for Ihe following reasons: 

The technical feature linking groups 1-33 appears to be a recombinant adenoviral vector wherein the adenoviral vector is at least • 
partially deleted in £1 but the vector may contain more deletions, die vector contains ^d type sequences itichT^g pagkagifig signals 
and a gene encoding a heterologous HIV protein or fragments thereof. Ertl et al. (WO 96/39178) disclose a recombinant adenoviral 
vector that is deleted in El and partially deleted in E3, Ihe remainder of the adenoviral vector contahis wild type sequeiKes. The 
vector additicmally contains an insertion of a heterologous protein which inchicfes HIV proteins (see abstract and claims 1 and 5). 
Therefore, the technical feamre linking the inventions of groups 1-4S does not constitute a speciaJ technical feature as dermed by PCT 
Rule 1 3.2, as it does not deflne a connibution over the prior art 

The q>ecial technical feaoire of the following groups 1-3,7-15. 19-30 and 34-48 is considered to be the comlnnation of sequexves that 
is disclosed in each ffwp, see individual claim groupings above for the different sequences. The DNA disclosed in each group is 
made up of a different sequence having a different structure and different function. 
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The special technical feature of group 4, 16 and 3 1 is considered to be a method of producing recombinant adenoviral particles. Each 
group contains diflerent sequences hence the resulting panicles would have difTerent stiuciures and functions associated with the 
particle. 

The special technical feature of group 5, 17 and 32 is considered to be a method of producing a cellular mediaied imrrajng response to 
Ihe heterologous protein encoded by the different adenoviral vectors. Each group contains Afferent sequences a encodii^ different 
protein, therefore the resulting immune response will also be different. 

The special technical feature of group 6. 18 and 33 is considered to be a method of producing a cellular niediated ixmiune response to 
the heterologous protein encoded by the different adenoviral vectors in conjunction with ttmnimiying the individual a DNA plasmid 
vaccine. Each method contains difTetent sequences encoding a different protein, therefore the resuldng immune ttspcmt will also be 
different. 

Accordingly, groups 1-48 are not so linked by the same or corresponding technical feature as to form a single general inventive 
concept. . 



Continuation of B. FIELDS SEARCHED Item 3: 
WEST 2.0, STN-BIOSIS. MEDLINE 

adenovirai vector, deletion, HIV, Gag, polyadenylation signal, CMV promoter 
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